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1 About This Data Sheet 

This data sheet provides a technical overview of the Alpha 21164 
microprocessor, including: 

Functional units 

Signal descriptions 

External interface 

Internal processor registers (IPRs) 

Privileged architecture library code (PALcode) instructions 

Electrical characteristics 

Thermal characteristics 

Mechanical packaging 

This data sheet is not intended to provide the reader with everything needed to 
begin chip implementation. For a more comprehensive description of the 21164 
and the Alpha architecture, refer to documents listed in the Technical Support 
and Ordering Information section located at the end of this document. 

Document Conventions 

Throughout this data sheet, the following conventions are used: 

• INTn refers to NATURALLY ALIGNED groups of n 8-bit bytes. For 
example: 

- I NT16— The four least significant address bits are 0. 

- I NTS— The three least significant address bits are 0. 

- I NT4— The two least significant address bits are 0. 

• Values of 1, 0, and X are used in some tables. The X signifies a dont care 
(1 or 0) convention, which can be determined by the system designer. 
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2 Alpha 21164 Microprocessor Features 

• Fully pipelined 64-bit advanced RISC architecture supports multiple 
operating systems, including: 

- M i crosoft Wi ndows N T 

- OSF/1 

- OpenVMS 

266-MHz through 300-MHz operation 
Superscalar 4-way instruction issue 
High-bandwidth (128-bit) interface 
Peak execution rate of 1200 MIPS 
0.50-/im CMOS technology 
Three onchip caches: 

- 8K-byte, direct-mapped, LI instruction cache 

- 8K-byte, dual-ported, direct-mapped, write-through LI data cache 

- 96K-byte, 3-way, set-associative, write-back L2 data and instruction 
cache 

• Supports optional board-level LB cache ranging from IM byteto64M bytes 

The 21164 microprocessor implements IEEE S_floating and T_floating, and 
VAX F_floating and G_floating data types and supports longword (32-bit) and 
quadword (64-bit) integers. Provides byte (8-bit) and word (16-bit) support by 
byte-manipulation instructions. Limited hardware support is provided for the 
VAX D_floating data type. 
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3 Microarchitecture 

The Alpha 21164 Microprocessor is a high-performance implementation of 
Digital's Alpha architecture. The following sections provide an overview of the 
chip's architecture and major functional units. 

Figure 1 is a block diagram of the 21164. A larger version of this figure 
is printed on a foldout page at the end of the Alpha 21164 Microprocessor 
H ardware Reference M anual . 

The 21164 consists of the following sections (Figure 1): 

• I nstruction fetch/decode and branch unit (I box) 

• Integer execution unit (Ebox) 

• Memory address translation unit (Mbox) 

• Cache control and bus interface unit (Cbox) 

• Floating-point execution unit (Fbox) 

• Data cache (Dcache) 

• I nstruction cache (I cache) 

• Secondary cache (Scache) 

• Serial read-only memory (SROM) interface 
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Figure 1 Alpha 21164 Microprocessor Block/Pipe Flow Diagram 
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3.1 Instruction Fetch/Decode and Branch Unit 

The primary function of the instruction fetch/decode and branch unit (I box) 
is to manage and issue instructions to the Ebox, IMbox, and Fbox. It also 
manages the instruction cache. The I box contains: 

Prefetcher and instruction buffer 

I nstruction slot and issue logic 

Program counter (PC) and branch prediction logic 

48-entry instruction translation buffers (ITBs) 

Abort logic 

Register conflict logic 

Interrupt and exception logic 

3.1.1 Instruction Prefetch and Decode 

The I box handles only NATURALLY ALIGNED groups of four instructions 
(I NT16). The I box does not advance to a new group of four instructions until 
all instructions in a group are issued. If a branch to the middle of an INT16 
group occurs, then the I box attempts to issue the instructions from the branch 
target to the end of the current I NT16, then it proceeds to the next I NT16 of 
instructions after all the instructions in the target INT16 are issued. Thus, 
proper code scheduling is required to achieve optimal performance. 

3.1.2 Branch Prediction 

The branch unit, or prediction logic, is also part of the I box. Branch and PC 
prediction are necessary to predict and begin fetching the target instruction 
stream before the branch or jump instruction is issued. Each instruction 
location in the instruction cache (I cache) contains a 2-bit history state to record 
the outcome of branch instructions. 

3.1.3 Instruction Translation Buffer 

The I box includes a 48-entry, fully associative instruction translation buffer 
(ITB). The buffer stores recently used instruction stream (I stream) address 
translations and protection information for pages ranging from 8 to 512 
kilobytes and uses a not-last-used replacement algorithm. 

The 21164 provides two optional translation extensions called superpages. 
Access to superpages is allowed only while executing in privileged mode. 

• One superpage maps virtual address bits <39:13>to physical address bits 
<39:13>, on a one-to-one basis, when virtual address bits <42:41> equal 2. 
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• The other superpage maps virtual address bits <29:13>to physical address 
bits <29:13>, on a one-to-one basis, and forces physical address bits <39:30> 
to when virtual address bits <42:30> equal IFFE(hex). 

3.1.4 Interrupts 

The I box exception logic supports three sources of interrupts: 

• Hardware interrupts 

There are seven level-sensitive hardware interrupt sources supplied by the 
following signals: 

irq_h<3:0> 
sysmchchkirqh 
pwrfailirqh 
mchhaltirqh 

• Software interrupts 

There are 15 prioritized software interrupts sourced by an onchip internal 
processor register (I PR). 

• Asynchronous system traps 

There are four asynchronous system traps (ASTs) controlled by onchip 
IPRs. 

Most interrupts can be independently masked in onchip enable registers. 
In addition, AST interrupts are qualified by the current processor mode. 
All interrupts are disabled when the processor is executing PALcode. 

3.2 Integer Execution Unit 

The integer execution unit (Ebox) contains two 64-bit integer execution 
pipelines— EO and El, which include the following: 

Two adders 

Two logic boxes 

A barrel shifter 

Byte-manipulation logic 

An integer multiplier 

The Ebox also includes the 40-entry, 64-bit integer register file (IRF) that 
contains the 32 integer registers defined by the Alpha architecture and 8 
PALshadow registers. The register file has four read ports and two write ports, 
which provide operands to both integer execution pipelines and accept results 
from both pipes. The register file also accepts load instruction results (memory 
data) on the same two write ports. 

6 Preliminary — Subject to Cliange — July 1996 



3.3 Floating-Point Execution Unit 

Theonchip, pipelined floating-point unit (FPU) can execute both IEEE and 
VAX floating-point instructions. The 21164 supports IEEE S_floating and 
T_floating data types, and all rounding modes. It also supports VAX F_floating 
and G_floating data types, and provides limited support for the D_floating 
format. The FPU contains: 

• A 32-entry, 64-bit floating-point register file (FRF). 

• A user-accessible control register. 

• A floating-point multiply pipeline. 

• A floating-point add pipeline— The floating-point divide unit is associated 
with the floating-point add pipeline but is not pipelined. 

The FPU can accept two instructions every cycle, with the exception of floating- 
point divide instructions. The result latency for nondivide, floating-point 
instructions is four cycles. 

3.4 Memory Address Translation Unit 

The memory address translation unit (Mbox) contains three major sections: 

• Data translation buffer (dual ported) 
Miss address file (MAF) 

• Write buffer address file 

The M box receives up to two virtual addresses every cycle from the Ebox. The 
translation buffer generates the corresponding physical addresses and access 
control information for each virtual address. The 21164 implements a 43-bit 
virtual address and a 40-bit physical address. 

3.4.1 Data Translation Buffer 

The 64-entry, fully associative, dual-read-ported data translation buffer (DTB) 
stores recently used data stream (Dstream) page table entries (PTEs). Each 
entry supports all four granularity hint-bit combinations, so that a single DTB 
entry can provide translation for up to 512 contiguously mapped, 8K-byte 
pages. 

The DTB also supports the register-enabled superpage extension. The DTB 
superpage maps provide virtual-to-physical address translation for two regions 
of the virtual address space. 
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3.4.2 Miss Address File 

The Mbox begins the execution of each load instruction by translating the 
virtual address and by accessing the data cache (Dcache). Translation and 
Dcachetag read operations occur in parallel. If the addressed location is found 
in the Dcache (a hit), then the data from the Dcache is formatted and written 
to either the integer register file (IRF) or floating-point register file (FRF). The 
formatting required depends on the particular load instruction executed. If 
the data is not found in the Dcache (a miss), then the address, target register 
number, and formatting information are entered in the miss address file (M AF). 

The MAF performs a load-merging function. When a load miss occurs, each 
M AF entry is checked to see if it contains a load miss that addresses the same 
Dcache (32-byte) block. If it does, and certain merging rules are satisfied, then 
the new load miss is merged with an existing MAF entry. This allows the Mbox 
to service two or more load misses with one data fill from the Cbox. 

There are six MAF entries for load misses and four more for I box instruction 
fetches and prefetches. Load misses are usually the highest Mbox priority. 

3.4.3 Store Execution 

The Dcache follows a write-through protocol. During the execution of a store 
instruction, the M box probes the Dcache to determine whether the location to 
be overwritten is currently cached. If so (a Dcache hit), the Dcache is updated. 
Regardless of the Dcache state, the M box forwards the data to the Cbox. 

A load instruction that is issued one cycle after a store instruction in the 
pipeline creates a conflict if both the load and store operations access the 
same memory location. (The store instruction has not yet updated the location 
when the load instruction reads it.) This conflict is handled by forcing the load 
instruction to take a replay trap; that is, the I box flushes the pipeline and 
restarts execution from the load instruction. By the time the load instruction 
arrives at the Dcache the second time, the conflicting store instruction has 
written the Dcache and the load instruction is executed normally. 

Replay traps can be avoided by scheduling the load instruction to issue three 
cycles after the store instruction. If the load instruction is scheduled to issue 
two cycles after the store instruction, then it will be issue-stalled for one cycle. 

3.4.4 Write Buffer 

The Mbox also contains a write buffer that has six 32-byte entries. The write 
buffer provides a finite, high-bandwidth resource for receiving store data to 
minimize the number of CPU stall cycles. 
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3.5 Cache Control and Bus Interface Unit 

The cache control and bus interface unit (Cbox) processes all accesses sent by 
the Mbox and implements all memory-related external interface functions, 
particularly the coherence protocol functions for write-back caching. It 
controls the second-level cache (Scache) and the optional board-level backup 
cache (Bcache). The Cbox handles all instruction and primary Dcache read 
misses, performs the function of writing data from the write buffer into the 
shared coherent memory subsystem, and has a major role in executing the 
Alpha memory barrier (MB) instruction. The Cbox also controls the 128-bit 
bidirectional data bus, address bus, and I/O control. 

3.6 Cache Organization 

The 21164 has three onchip caches— a primary LI data cache, a primary LI 
instruction cache, and a second-level L2 combined data and instruction cache. 
All memory cells in the onchip caches are fully static, 6-transistor, CMOS 
structures. 

The 21164 also provides control for an optional board-level, external L3 cache. 

3.6.1 Data Cache 

The data cache (Dcache) is a dual-read-ported, single-write-ported, 8K-byte 
cache. It is a write-through, read-allocate, direct-mapped, physical cache with 
32-byte blocks. 

3.6.2 Instruction Cache 

The instruction cache (I cache) is an 8K-byte, virtual, direct-mapped cache with 
32-byte blocks. Each block tag contains: 

• A 7-bit address space number (ASN) field as defined by the Alpha 
architecture 

• A 1-bit address space match (ASM) field as defined by the Alpha 
architecture 

• A 1-bit PALcode (physically addressed) indicator 

Software, rather than I cache hardware, maintains I cache coherence with 
memory. 
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3.6.3 Second-Level Cache 

The second-level cache (Scache) is a 96K-byte, 3-way, set-associative, physical, 
write-back, write-allocate cache with 32- or 64-byte blocks. It is a mixed data 
and instruction cache. The Scache is fully pipelined; it processes read and 
write operations at the rate of one I NT16 per CPU cycle and can alternate 
between read and write accesses without bubble cycles. 

When operating in 32-byte block mode, the Scache has 64-byte blocks with 
32-byte subblocks, one tag per block. If configured to 32 bytes, the Scache is 
organized as three sets of 512 blocks, with each block divided into two 32-byte 
subblocks. If configured to 64 bytes, the Scache is three sets of 512 64-byte 
blocks. 

3.6.4 External Cache 

TheCbox implements control for an optional, external, direct-mapped, physical, 
write-back, write-allocate cache with 32- or 64-byte blocks. The 21164 supports 
board-level cache sizes of 1, 2, 4, 8, 16, 32, and 64 megabytes. 

3.7 Serial Read-Only Memory Interface 

The serial read-only memory (SROM) interface provides the initialization 
data load path from a system SROM to the instruction cache. Following 
initialization, this interface can function as a diagnostic port by using 
privileged architecture library code (PALcode). 

3.8 Pipeline Organization 

The 21164 has a 7-stage (or 7-cycle) pipeline for integer operate and memory 
reference instructions, and a 9-stage pipeline for floating-point operate 
instructions. The I box maintains state for all pipeline stages to track 
outstanding register write operations. 

Figure 2 shows the integer operate, memory reference, and floating-point 
operate pipelines for the I box, FPU, Ebox, and M box. The first four stages are 
executed in the I box. Remaining stages are executed by the E box, Fbox, M box, 
and Cbox. 
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Figure 2 Instruction Pipeline Stages 
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4 Pinout and Signal Descriptions 

Sections 4.1 and 4.2 list and describe tine 21164 microprocessor external 
signals, and their associated pins. 

4.1 Pin Assignment 

The 21164 package has 499 pins aligned in an interstitial pin grid array 
(I PGA) design. Table 1 lists the 21164 signal pins and their corresponding pin 
grid array (PGA) locations in alphabetic order. There are 292 functional signal 
pins, 2 spare (unused) signal pins, 104 power (Vdd) pins, and 101 ground (Vss) 
pins. 

Table 1 Alphabetic Signal Pin List 





PGA 




PGA 




PGA 


Signal 


Location 


Signal 


Location 


Signal 


Location 


addr_bus_req_h 


E23 


addr_cmd_par_h 


B20 


addr_h<4> 


BB14 


addr_h«5> 


BC13 


addr_h<a6> 


BA13 


addr_h<i7> 


AV14 


addr_h-^> 


AW13 


addr_h-^> 


BCll 


addr_h<lO> 


BAll 


addr_h<ll> 


AV12 


addr_h<12> 


AW 11 


addr_h<L3> 


BC09 


addr_h<a4> 


BA09 


addr_h<15> 


AVIO 


addr_h<l6> 


AW09 


addr_h<l7> 


BC07 


addr_h<18> 


BA07 


addr_h<l9> 


AV08 


addr_h<20> 


AW07 


addr_h<21> 


BC05 


addr_h<22> 


BC39 


addr_h<23> 


AW37 


addr_h<24> 


AV36 


addr_h<25> 


BA37 


addr_h«26> 


BC37 


addr_h<27> 


AW35 


addr_h<28> 


AV34 


addr_h<29> 


BA35 


addr_h<30> 


BC35 


addr_h<31> 


AW33 


addr_h<32> 


AV32 


addr_h<33> 


BA33 


addr_h<34> 


BC33 


addr_h<35> 


AW31 


addr_h<36> 


AV30 


addr_h<37> 


BA31 


addr_h<38> 


BC31 


addr_h<39> 


BB30 


addr_res_h<0> 


C27 


addr_res_h<l> 


F26 


addr_res_h-^> 


E27 


cack_h 


G21 


cfail_h 


C25 


clk_mode_hO> 


AU21 


c 1 k_mode_h '^> 


BA23 


cmd_h<0> 


F20 


cmd_h<l> 


A 19 


cmd_h<^> 


C19 


cmd_h<3> 


E19 


cpu_clk_out_h 


BA25 


dackh 


B24 


databusreqh 


E25 


data_check_hO> 


J 41 


data_check_h<l> 


K38 


data_check_h«3> 


J 39 


data_check_h«3> 


G43 


datachec k_h <3l> 

(continued 


G41 
on next page) 
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Table 1 (Cont.) Alphabetic Signal Pin List 





PGA 




PGA 




PGA 


Signal 


Location 


Signal 


Location 


Signal 


Location 


data_check_h"^> 


H38 


data_check_h<6> 


G39 


data_check_h<i7> 


E43 


data_check_h-^> 


J 03 


data_check_h<9> 


K06 


data_check_h<10> 


J 05 


data_check_h<U> 


GOl 


data_check_h<12> 


G03 


data_check_h<13> 


H06 


data_check_h<14> 


G05 


data_check_h«^5> 


EOl 


data_hO> 


J 43 


data_h«^> 


L39 


data_h<2> 


M38 


data_h<3> 


L41 


data_h<4> 


L43 


data_h<5> 


N39 


data_h<*> 


P38 


data_h<i7> 


N41 


data_h<fi> 


N43 


data_h<^> 


P42 


data_h<^0> 


R39 


data_h<^l> 


T38 


data_h<12> 


R41 


data_h<13> 


R43 


data_h<l4> 


U39 


data_h<l5> 


V38 


data_h<16> 


U41 


data_h<17> 


U43 


data_h<l8> 


W39 


data_h<l9> 


W41 


data_h<20> 


W43 


data_h<21> 


Y38 


data_h<22> 


Y42 


data_h<23> 


AA39 


data h<74> 


AA41 


data_h<25> 


AA43 


data_h<^6> 


AB38 


data_h<27> 


AC43 


data_h<28> 


AC41 


data_h<29> 


AC39 


data_h<30> 


AD42 


data_h<31> 


AD38 


data_h<32> 


AE43 


data_h<33> 


AE41 


data_h<34> 


AE39 


data_h<35> 


AG43 


data_h<36> 


AG41 


data_h<37> 


AF38 


data_h<38> 


AG39 


data_h<39> 


AJ43 


data_h<aO> 


AJ41 


data_h<41> 


AH 38 


data_h<42> 


AJ39 


data_h<43> 


AK42 


data_h<J4> 


AL43 


data_h<45> 


AL41 


data_h<46> 


AK38 


data_h<47> 


AL39 


data_h<48> 


AN 43 


data_h<49> 


AN41 


data_h<50> 


AM38 


data_h<51> 


AN39 


data_h<^2> 


AR43 


data_h<53> 


AR41 


data_h<54> 


AP38 


data_h<55> 


AR39 


data_h<56> 


AU43 


data_h<57> 


AU41 


data_h<58> 


AT38 


data_h<59> 


AU39 


data_h<60> 


AW43 


data_h<61> 


AW41 


data_h<a62> 


AV38 


data h<63> 


AW39 


data h<fi4> 


J 01 


data_h<*5> 


105 


data_h<*6> 


M06 


data_h<67> 


L03 


data_h<fi8> 


LOl 


data_h<*9> 


N05 


data_h<i70> 


P06 


data_h<i71> 


N03 


data_h<i72> 

(continued 


NOl 

on next page) 
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Table 1 (Cont.) 


Alphabetic 


Signal Pin List 










PGA 




PGA 




PGA 


Signal 


Location 


Signal 


Location 


Signal 


Location 


data_h"i73> 


P02 


data_h^4> 


R05 


data_h^5> 


T06 


data_h^6> 


ROB 


data_h^7> 


ROl 


data_h^8> 


U05 


data_h-i79> 


V06 


data_h<80> 


U03 


data_h<81> 


UOl 


data_h<^2> 


W05 


data h<83> 


W03 


data h<84> 


WOl 


data_h<^5> 


Y06 


data_h<^6> 


Y02 


data_h<^7> 


AA05 


data_h<^8> 


AA03 


data_h<^9> 


AAOl 


data_h<^0> 


AB06 


data_h<^l> 


ACOl 


data_h<^2> 


AC03 


data_h<^3> 


AC05 


data_h<94> 


AD02 


data_h<^5> 


AD06 


data_h<^6> 


AEOl 


data_h<97> 


AE03 


data_h<98> 


AE05 


data_h<99> 


AGOl 


data_h<lOO> 


AG03 


data_h<l01> 


AF06 


data_h<102> 


AG05 


data_h<l03> 


AJOl 


data_h<l04> 


AJ03 


data_h<l05> 


AH 06 


data_h<106> 


AJ05 


data_h<107> 


AK02 


data_h<a.08> 


ALOl 


data_h<109> 


AL03 


data_h<110> 


AK06 


data h<111> 


AL05 


data h^1?> 


ANOl 


data_h<113> 


AN 03 


data_h<Ll4> 


AM 06 


data h^1'>> 


AN 05 


data_h<116> 


AROl 


data_h<Ll7> 


AR03 


data h<n8> 


AP06 


data h<n9> 


AR05 


data_h<120> 


AUOl 


data_h<l21> 


AU03 


data_h<l22> 


AT 06 


data h<173> 


AU05 


data_h<l24> 


AWOl 


data_h<l25> 


AW03 


data_h<126> 


AV06 


data_h<127> 


AW05 


data_ram_oe_h 


F22 


data_ram_we_h 


A23 


dcokh 


AU23 


fill_error_h 


A25 


fill_h 


G23 


fill_id_h 


F24 


fill_nocheck_h 


G25 


idlebch 


A27 


index_h<4> 


A29 


index_h'^> 


C29 


index_h<6> 


F28 


index_h<i7> 


E29 


index_h-^> 


B30 


index_h-^> 


A31 


index_h<10> 


C31 


index_h-^l> 


F30 


index_h<12> 


E31 


index_h<13> 


A33 


index_h-^4> 


C33 


index_h<15> 


F32 


index_h<16> 


E33 


index_h«^7> 


A35 


index_h<18> 


C35 


index_h<19> 


F34 


index_h<20> 


E35 


index_h<^l> 


A37 


index_h«^2> 


C37 


index_h«23> 


F36 


index_h<^4> 

(continued 


E37 

on next page) 
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Table 1 (Cont.) Alphabetic Signal Pin List 





PGA 




PGA 




PGA 


Signal 


Location 


Signal 


Location 


Signal 


Location 


index_h-^5> 


A39 


int4_valid_h<0> 


F38 


int4_valid_h<l> 


E41 


int4_valid_h<2> 


F06 


int4_valid_h<3> 


E03 


irq^h<0> 


BA29 


irq^h<l> 


AU27 


irq_h-^> 


BC29 


irq^h<3> 


AW27 


mchhitjrqh 


AU25 


oscclkjnh 


BC21 


oscclkjnj 


BB22 


perf_mon_h 


AW29 


port_mode_h <0> 


AY 20 


port_mode_h <1> 


BB20 


pwr_fail_irq_h 


AV26 


ref_clk_in_h 


AW25 


scache_set_hO> 


C17 


scache_set_h<l> 


A 17 


shared_h 


C23 


srom_clk_h 


BA19 


srom_data_h 


BC19 


srom_oe_l 


AW 19 


srom_present_l 


AV20 


stcikh 


EOS 


systemjock flagh 


G27 


sysclkoutlh 


AW23 


sysclkoutlj 


BB24 


sys_clk_out2_h 


AV24 


sys_clk_out2_l 


BC25 


sysmchchkjrqh 


BA27 


sysresetj 


BC27 


tagctlparh 


F18 


tag_data_h<^0> 


A05 


tag_data_h<21> 


E07 


tag_data_h<^2> 


FOB 


tag_data_h<23> 


C07 


tag_data_h<24> 


A07 


tag_data_h<^5> 


E09 


tag_data_h<^6> 


FIO 


tag_data_h<27> 


C09 


tag_data_h<28> 


A09 


tag_data_h<^9> 


Ell 


tag_data_h<30> 


F12 


tag_data_h<31> 


Cll 


tag_data_h<32> 


All 


tag_data_h<33> 


E13 


tag_data_h<34> 


F14 


tag_data_h<35> 


C13 


tag_data_h<36> 


A13 


tag_data_h<37> 


B14 


tag_data_h<38> 


E15 


tagdataparh 


C15 


tag_dirty_h 


E17 


tag_ram_oe_h 


C21 


tag_ram_we_h 


A21 


tag_shared_h 


A15 


tag_valid_h 


F16 


tckh 


AW17 


tdih 


BC17 


tdo_h 


BA17 


tempsense 


AW15 


test status hO> 


BA15 


test_status_h<l> 


AV16 


tms_h 


AV18 


trst_l 


BC15 


victi mpendi ngh 


E21 


spare_in<438> 


E39 


spare_io-^50> 

(continued 


AV28 

on next page) 
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Table 1 (Cont.) Alphabetic Signal Pin List 



Signal PGA Location 



Vss— Metal planes A03, A41, AA07, AA37, AC07, AC37, AD04, AD40, AF02, AF42, AG07, AG37, AH04, 

2i and 5^ AH 40, AL07, AL37, AM 04, AM 40, AP02, AP42, AR07, AR37, AT04, AT 40, AU09, 

AU13, AU17, AU31, AU35, AV02, AV22, AV42, AW21, AY04, AY08, AY12, AY16, 
AY22, AY24, AY28, AY32, AY36, AY 40, B02, B06, BIO, B18, B26, B34, B38, B42, 
BAOl, BA21, BA43, BB02, BB06, BBIO, BB18, BB26, BB34, BB38, BB42, BC03, 
BC41, COl, C43, D04, D08, D12, D16, D20, D24, D28, D32, D36, D40, F02, F42, G09, 
G13, G17, G31, G35, H04, H40, J 07, J 37, K02, K42, M04, M40, N07, N37, T04, T40, 
U07, U37, V02, V42, Y04, Y40 

Vdd AB02, AB04, AB40, AB42, AE07, AE37, AF04, AF40, AH02, AH42, AJ 07, AJ 37, 

Metal planes 4 and 6 AK04, AK40, AM02, AM42, AN07, AN37, AP04, AP40, AT02, AT42, AU07, AUll, 
AU15, AU19, AU29, AU33, AU37, AV04, AV40, AY02, AY06, AYIO, AY14, AY18, 
AY26, AY30, AY34, AY38, AY 42, B04, B08, B12, B16, B22, B28, B32, B36, B40, BA03, 
BA05, BA39, BA41, BB04, BB08, BB12, BB16, BB28, BB32, BB36, BB40, BC23, C03, 
COS, C39, C41, D02, D06, DIO, D14, D18, D22, D26, D30, D34, D38, D42, F04, F40, 
Gil, G15, G19, G29, G33, G37, H02, H42, K04, K40, L07, L37, M02, M42, P04, P40, 
R07, R37, T02, T42, V04, V40, W07, W37 



■"^Metal plane 2— Seal ring connection tied to Vss 

^Metal plane 5— Heat slug braze pad connections tied to Vss 
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4.2 Alpha 21164 Packaging 

Figure 3 shows the 21164 pinout from the top view with pins facing down. 

Figure 3 Alpha 21164 Top View (Pin Down) 
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Figure 4 shows the 21164 pinout from the bottom view with pins facing up. 



Figure 4 Alpha 21164 Bottom View (Pin Up) 
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LJ-03413-TIOB 



4.3 Alpha 21164 Microprocessor Logic Symbol 

Figure 5 shows the logic symbol for the 21164 chip. 
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Figure 5 Alpha 21164 Microprocessor Logic Symbol 
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4.4 Alpha 21164 Signal Names and Functions 

The following table defines the 21164 signal types referred to in this section: 



Signal Type 



Definition 



B 



Bidirectional 
Input only 
Output only 



The remaining two tables describe the function of each 21164 external signal. 
Table 2 lists all signals in alphanumeric order. This table provides full signal 
descriptions. Table 3 lists signals by function and provides an abbreviated 
description. 



Table 2 Alpha 21164 Signal Descriptions 



Signal 



Type Count Description 



addr_h<39:4> B 

addrbusreqh I 

addrcmdparh B 

addr res h<l:0> O 



36 Address bus. These bidirectional signals provide the 

address of the requested data or operation between the 
21164 and the system. If bit 39 is asserted, then the 
reference is to noncached, I/O memory space. 

1 Address bus request. The system interface uses 

this signal to gain control of the addr_h<39:4>, 
addrcmdparh, and cmd_h<3:0>pins. 

1 Address command parity This is the odd parity bit on 
the current command and address buses. The 21164 
takes a machine check if a parity error is detected. The 
system should do the same if it detects an error. 

2 Address response bits <1> and <0>. For system 
commands, the 21164 uses these pins to indicate the 
state of the block in the Scache: 



Bits 


Command 


Meaning 


00 


NOP 


Nothing. 


01 


NOACK 


Data not found or clean. 


10 


ACK/Scache 


Data from Scache. 


11 


AC K/B cache 


Data from Bcache. 



(continued on next page) 
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Table 2 (Cont.) Alpha 21164 Signal Descriptions 



Signal 



Type Count Description 



addr res h<2> 



cack h 



cfail h 



elk mode h<l:0> 



cmd h<3:0> 



1 Address response bit <2>. For system commands, the 

21164 uses this pin to indicate if the command hits in 
the Scache or onchip load locl< register. 

1 1 Command acl<nowledge. The system interface uses this 

signal to acknowledge any one of the commands driven 
by the 21164. 

I 1 Command fail. This signal has two uses. It can be 

asserted during a cack cycle of a WRITE BLOCK LOCK 
command to indicate that the write operation is not 
successful. In this case, both cack_h and cfail_h are 
asserted together. It can also be asserted instead of 
cack_h to force an instruction fetch/decode unit (I box) 
timeout event. This causes the 21164 to do a partial 
reset and trap to the machine check (MCHK) PALcode 
entry point, which indicates a serious hardware error. 

I 2 Clock test mode. These signals specify a relationship 

between osc_clk_in_h,l and the CPU cycle time. These 
signals should bedeasserted in normal operation mode. 

B 4 Command bus. These signals drive and receive the 

commands from the command bus. The following 
tables define the commands that can be driven on the 
cmd_h<3:0> bus by the 21164 or the system. 

(continued on next page) 
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Table 2 (Cont.) Alpha 21164 Signal Descriptions 



Signal 



Type Count Description 



21164 Commands to System: 



cmd h 
<3:0> 


Command 


Meaning 


0000 


NOP 


Nothing. 


0001 


LOCK 


Lock register address. 


0010 


FETCH 


Tlie 21164 passes a 
FETCH instruction to 
tlie system. 


0011 


FETCH_M 


Tlie 21164 passes a 
FETCH_M instruction 
to tlie system. 


0100 


MEMORY 
BARRIER 


MB instruction. 


0101 


SET DIRTY 


Dirty bit set if sliared 
bit is clear. 


Olio 


WRITE BLOCK 


Request to write a 
blocl<. 


cm 


WRITE BLOCK 
LOCK 


Request to write a 
blocl< with locl<. 


1000 


READ MISSO 


Request for data. 


1001 


READ MISSl 


Request for data. 


1010 


READ MISS MODO 


Request for data; 
modify intent. 


1011 


READ MISS MODI 


Request for data; 
modify intent. 


1100 


BCACHE VICTIM 


Bcache victim should 
be removed. 


1101 


— 


Reserved. 


1110 


READ MISS MOD 
STCO 


Request for data, 
STx_C data. 


1111 


READ MISS MOD 
STCl 


Request for data, 
STx_C data. 



(continued on next page) 
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Table 2 (Cont.) Alpha 21164 Signal Descriptions 



Signal 



Type Count Description 



System Commands to 21164: 



cmd_h 
<3:0> 



0111 



Command 



0000 


NOP 


0001 


FLUSH 


0010 


INVALIDATE 


0011 


SET SHARED 


0100 


READ 


0101 


READ DIRTY 



Meaning 



READ DIRTY/I NV 



Nothing. 

Remove block from 
caches; return dirty 
data. 

Invalidate the block 
from caches. 

Block goes to the 
shared state. 

Read a block. 

Read a block; set 
shared. 

Read a block; 
invalidate. 



cpu_clk_out_h 
dackh 

data_h<127:0> 
data_bus_req_h 



O 



data check h<15:0> B 



1 CPU clock output. This signal is used for test purposes. 

1 Data acknowledge. The system interface uses this 

signal to control data transfer between the 21164 and 
the system. 

128 Data bus. These signals are used to move data between 
the 21164, the system, and the Bcache. 

1 Data bus request. If the 21164 samples this signal 

asserted on the rising edge of sysclk n, then the 21164 
does not drive the data bus on the rising edge of sysclk 
n+1. Before asserting this signal, the system should 
assert idle_bc_h for the correct number of cycles. If the 
21164 samples this signal deasserted on the rising edge 
of sysclk n, then the 21164 drives the data bus on the 
rising edge of sysclk n+1. 

16 Data check. These signals set even byte parity or I NTS 
ECC for the current data cycle. 

(continued on next page) 
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Table 2 (Cont.) Alpha 21164 Signal Descriptions 



Signal 



Type Count Description 



data_ram_oe_h 





1 


data_ram_we_h 





1 


dc ok h 


1 


1 



fill h 



fill error h 



fill id h 



fill nocheck h 



idle be h 



index h<25:4> 



22 



Data RAM output enable. This signal is asserted for 
Bcache read operations. 

Data RAM write-enable. This signal is asserted for any 
Bcache write operation. 

dc voltage OK. Must be deasserted until dc voltage 
reaches proper operating level. After that, dc_ok_h is 
asserted. 

Fill warning. If the 21164 samples this signal asserted 
on the rising edge of sysclk n, then the 21164 provides 
the address indicated by fiii_id_h to the Bcache on the 
rising edge of sysclk n+1. The Bcache begins to write in 
that sysclk. At the end of sysclk n+1, the 21164 waits 
for the next sysclk and then begins the write operation 
again if dack_h is not asserted. 

Fill error. If this signal is asserted during a fill from 
memory, it indicates to the 21164 that the system has 
detected an invalid address or hard error. The system 
still provides an apparently normal read sequence with 
correct ECC/parity though the data is not valid. The 
21164 traps to the machine check (MCHK) PALcode 
entry point and indicates a serious hardware error. 
fiii_error_h should be asserted when the data is 
returned. Each assertion produces a MCHK trap. 

Fill identification. Asserted with fiii_h to indicate which 
register is used. The 21164 supports two outstanding 
load instructions. If this signal is asserted when the 
21164 samples fiii_h asserted, then the 21164 provides 
the address from miss register 1. If it is deasserted, 
then the address in miss register is used for the read 
operation. 

Fill checking off. If this signal is asserted, then the 
21164 does not check the parity or ECC for the current 
data cycle on a fill. 

Idle Bcache. When asserted, the 21164 finishes the 
current Bcache read or write operation but does not 
start a new read or write operation until the signal 
is deasserted. The system interface must assert this 
signal in time to idle the Bcache before fill data arrives. 

Index. These signals index the Bcache. 

(continued on next page) 
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Table 2 (Cont.) 


Alpha 21164 Signal Descriptions 


Signal 


Type Count Description 



int4_valid_h<3:0> O 4 I NT4 data valid. During write operations to noncached 

space, these signals are used to indicate which I NT4 
bytes of data are valid. This is useful for noncached 
write operations that have been merged in the write 
buffer. 



int4_valid_h<3:0> Write Meaning 

xxxl data_h<31:0>valid 

xxlx data_h<]63:32> valid 

xlxx data_h<95:64> valid 

Ixxx data_h<127:96>valid 

During read operations to noncached space, these 
signals indicate which I NTS bytes of a 32-byte block 
need to be read and returned to the processor. This is 
useful for read operations to noncached memory. 

int4_valid_li<3:0> Read IVIeaning 

xxxl data_h<63:0>valid 

xxlx data_h<127:64>valid 

xlxx data_h<L91:128> val id 

Ixxx data_h<255:192> valid 

Note: For both read and write operations, multiple 
int4_valid_h<3:0>bits can beset simultaneously. 

(continued on next page) 



Preliminary — Subject to Ciiange — Juiy 1996 25 



Table 2 (Cont.) Alpha 21164 Signal Descriptions 



Signal 



Type Count Description 



irq_h<3:0> 



System interrupt requests. These signals have multiple 
modes of operation. During normal operation, these 
level -sensitive signals are used to signal interrupt 
requests. During initialization, these signals are used to 
set up the CPU cycle time divisor for sys_clk_outl_h,l 
as follows: 







irq_hi 






<3> 


<2> 


<1> 


<0> 


Ratio 


Low 


Low 


High 


High 


3 


Low 


High 


Low 


Low 


4 


Low 


High 


Low 


High 


5 


Low 


High 


High 


Low 


6 


Low 


High 


High 


High 


7 


High 


Low 


Low 


Low 


8 


High 


Low 


Low 


High 


9 


High 


Low 


High 


Low 


10 


High 


Low 


High 


High 


11 


High 


High 


Low 


Low 


12 


High 


High 


Low 


High 


13 


High 


High 


High 


Low 


14 


High 


High 


High 


High 


15 



mchhltjrqh 



oscclkjnh 
osc elk in I 



1 Machine halt interrupt request. This signal has 

multiple modes of operation. During initialization, 
this signal is used to set up sys_clk_out2_h,l delay. 
During normal operation, it is used to signal a halt 
request. 

1 Oscillator clock inputs. These signals provide the 

1 differential clock input that is the fundamental timing 

of the 21164. These signals are driven at twice the 
desired internal clock frequency. (Under normal 
operating conditions the CPU cycle time is one-half 
the frequency of osc_clk_in.) 

(continued on next page) 
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Table 2 (Cont.) Alpha 21164 Signal Descriptions 



Signal 



Type Count Description 



perfmonh 



port_mode_h<l:0> I 2 



pwr_fail_irq_h 



ref elk in h 



I 1 



I 1 



scache set h<l:0> O 2 



shared h 



sromcikh 





1 


sromdatah 


1 


1 


sromoej 





1 


srom present 1^ 


B 


1 



Performance monitor. This signal can be used as an 
input to tlie 21164 internal performance monitoring 
hardware from offchip events (such as bus activity). 

Select test port interface modes (normal, manufacturing, 
and debug). For normal operation, both signals must be 
deasserted. 

Power failure interrupt request. This signal has 
multiple modes of operation. During initialization, 
this signal is used to set up sys_clk_out2_h,l delay. 
During normal operation, this signal is used to signal a 
power failure. 

Reference clock input. Optional. Used to synchronize 
the timing of multiple microprocessors to a single 
reference clock. If this signal is not used, it must be tied 
to Vdd for proper operation. 

Secondary cache set. During a read miss request, these 
signals indicate the Scache set number that will be 
filled when the data is returned. This information can 
be used by the system to maintain a duplicate copy of 
the Scache tag store. 

Keep block status shared. For systems without a 
Bcache, when a WRITE BLOCK/NO VICTIM PENDING 
or WRITE BLOCK LOCK command is acknowledged, 
this pin can be used to keep the block status shared or 
private in the Scache. 

Serial ROM clock. Supplies the clock that causes the 
SROM to advance to the next bit. Thecyde time of this 
clock is 128 times the cycle time of the CPU clock. 

Serial ROM data. Input for the SROM. 

Serial ROM output enable. Supplies the output enable 
to the SROM. 

Serial ROM present. Indicates that SROM is present 
and ready to load the I cache. 



■"^This signal is shown as bidirectional. However, for normal operation it is input only. The output function is 
used during manufacturing test and verification only. 



(continued on next page) 
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Table 2 (Cont.) Alpha 21164 Signal Descriptions 



Signal 



Type Count Description 



St elk h 



sys_clk_outl_h 
sysclkoutlj 

sys_clk_out2_h 
sys_clk_out2_l 

sys_mch_chk_i rqh 



sys_reset_l 



systemi oc kflagh 



tag_ctl_par_h 



O 






1 





1 





1 





1 



tagdatah <38:20> 


B 


19 


tag_data_par_h 


B 


1 


tagdirtyh 


B 


1 


tag_ram_oe_h 





1 



STRAM clock. Clock for Bcache synchronously 
timed RAMs (STRAMs). This signal is synchronous 
with index_h<25:4> during private read and write 
operations, and with sys_clk_outl_h,l during read and 
fill operations. 

System clock outputs. Programmable system clock 
(cpu_clk_out_h divided by a value of 3 to 15) is used 
for board-level cache and system logic. 

System clock outputs. A version of sys_clk_outl_h,l 
delayed by a programmable amount from to 7 CPU 
cycles. 

System machine check interrupt request. This signal 
has multiple modes of operation. During initialization, 
it is used to set up sys_clk_out2_h,l delay. During 
normal operation, it is used to signal a machine 
interrupt check request. 

System reset. This signal protects the 21164 from 
damage during initial power-up. It must be asserted 
until dc_ok_h is asserted. After that, it is deasserted 
and the 21164 begins its reset sequence. 

System lock flag. During fills, the 21164 logically AN Ds 
the value of the system copy with its own copy to 
produce the true value of the lock flag. 

Tag control parity. This signal indicates odd parity for 
tag_valid_h, tag_shared_h, and tag_dirty_h. During 
fills, the system should drive the correct parity based on 
the state of the valid, shared, and dirty bits. 

Bcache tag data bits. This bit range supports IM-byte 
to 64M-byte Beaches. 

Tag data parity bit. This signal indicates odd parity for 
tagdatah <38:20>. 

Tag dirty state bit. During fills, the system should 
assert this signal if the 21164 request is a READ M I SS 
MOD, and the shared bit is not asserted. 

Tag RAM output enable. This signal is asserted during 
any Bcache read operation. 



(continued on next page) 
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Table 2 (Cont.) 


Alpha 21164 Signal Descriptions 


Signal 


Type Count Description 



tagramweh 



O 



tag_shared_h 


tag_valid_h 


tck_h 


tdi_h 


tdo_h 


tempsense 



test status h<l:0> 



O 



tmsh 


1 


1 


trstji 


B 


1 


victi mpendi ngh 





1 



Tag RAM write-enable. This signal is asserted 
during any tag write operation. During the first 
CPU cycle of a write operation, the write pulse is 
deasserted. In the second and following CPU cycles 
of a write operation, the write pulse is asserted if the 
corresponding bit in the write pulse register is asserted. 
Bits BC_WE_CTL<8:0> control the shape of the pulse. 

Tag shared bit. During fills, the system should drive 
this signal with the correct value to mark the cache 
block as shared. 

Tag valid bit. During fills, this signal is asserted to 
indicate that the block has valid data. 

J TAG boundary scan clock. 

J TAG serial boundary scan data-in signal. 

J TAG serial boundary scan data-out signal. 

Temperature sense. This signal is used to measure the 
die temperature and is for manufacturing use only. For 
normal operation, this signal must be left disconnected. 

I cache test status. These signals are used for 
manufacturing test purposes only to extract I cache test 
status information from the chip. test_status_h<0> 
is asserted if ICSR<39> is true, on I box timeout, or 
remains asserted if the I cache built-in self-test (BiSt) 
fails. Also, test_status_h<0> outputs the value written 
by PALcode to test_status_h<l> through I PR access. 

J TAG test mode select signal. 

J TAG test access port (TAP) reset signal. 

Victim pending. When asserted, this signal indicates 
that the current read miss has generated a victim. 



^This signal is shown as bidirectional. However, for normal operation it is input only. The output function is 
used during manufacturing test and verification only. 



Preliminary— Subject to Ciiange—July 1996 29 



Table 3 lists signals by function and provides an abbreviated description. 



Table 3 Alpha 21164 Signal Descriptions by Function 



Signal 


Type 


Count 


Description 


Clocks 


clk_mode_h<a:0> 


1 


2 


Clock test mode. 


cpuclkouth 





1 


CPU clock output. 


oscclkjnh.l 


1 


2 


Oscillator clock inputs. 


refclkjnh 


1 


1 


Reference clock input. 


st_clk_h 





1 


Bcache 5TRAM clock output. 


sys_clk_outl_h,l 





2 


System clock outputs. 


sys_clk_out2_h,l 





2 


System clock outputs. 


sys_reset_l 


1 


1 


System reset. 


Bcache 



data_h<127:0> 


B 


128 


data_check_h<15:0> 


B 


16 


dataramoeh 







data_ram_we_h 







index_h<25:4> 





22 


tag_ctl_par_h 


B 




tag_data_h <38:20> 


B 


19 


tagdataparh 


B 




tag_dirty_h 


B 




tag_ram_oe_h 







tag_ram_we_h 







tag_shared_h 


B 




tag_valid_h 


B 





Data bus. 

Data check. 

Data RAM output enable. 

Data RAM write-enable. 

I ndex. 

Tag control parity. 

Bcache tag data bits. 

Tag data parity bit. 

Tag dirty state bit. 

Tag RAM output enable. 

Tag RAM write-enable. 

Tag shared bit. 

Tag valid bit. 

(continued on next page) 
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Table 3 (Cont.) Alpha 21164 Signal Descriptions by Function 



Signal 



Type Count Description 



System Interface 



addr_h<39:4> 

addr_bus_req_h 

addrcmdparh 

addr_res_h<2:0> 

cackh 

cfailh 

cmd_h<3:0> 

dackh 

data_bus_req_h 

fill_h 

fillerrorh 

filljd_h 

fillnocheckh 

idle_bc_h 

int4_valid_h<3:0> 

scacheseth <1:0> 

shared_h 

system_lock_flag_h 

victimpendingh 



B 
O 



O 
O 



O 



36 Address bus. 

1 Address bus request. 

1 Address command parity. 

3 Address response. 

1 Command acknowledge. 

1 Command fail. 

4 Command bus. 

1 Data acknowledge. 

1 Data bus request. 

1 Fill warning. 

1 Fill error. 

1 Fill identification. 

1 Fill checking off. 

1 IdleBcache. 

4 1NT4 data valid. 

2 Secondary cache set. 

1 Keep block status shared. 

1 System lock flag. 

1 Victim pending. 



Interrupts 

irq_h<3:0> 
mchhltjrqh 
pwr_fail_irq_h 
sys_mch_chk_i rqh 



4 System interrupt requests. 

1 Machine halt interrupt request. 

1 Power failure interrupt request. 

1 System machine check interrupt request. 

(continued on next page) 
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Table 3 (Cont.) Alpha 21164 Signal Descriptions by Function 



Signal 



Type Count Description 



Test Modes and Miscellaneous 



dc_ok_h 
perfmonh 
port_mode_h <1:0> 

sromcikh 

sromdatah 

srom_oe_l 

sr ompr esenti ^ 

tck_h 

tdi_h 

tdo_h 

tempsense 

teststatush <1:0> 

tms_h 

trst |i 



1 


1 


dc voltage OK. 


1 


1 


Performance monitor. 


1 


2 


Select test port interface modes (normal 
manufacturing, and debug). 





1 


Serial ROM clock. 


1 


1 


Serial ROM data. 





1 


Serial ROM output enable. 


B 


1 


Serial ROM present. 


B 


1 


J TAG boundary scan clock. 


1 


1 


J TAG serial boundary scan data in. 





1 


J TAG serial boundary scan data out. 


1 


1 


Temperature sense. 





2 


1 cache test status. 


1 


1 


J TAG test mode select. 


B 


1 


J TAG test access port (TAP) reset. 



^This signal is sliown as bidirectional, 
function is used during manufacturing 



However, for normal operation it is input only. Tlie output 
test and verification only. 
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5 Alpha 21164 Microprocessor Functional Overview 

This section provides an overview of 21164 external signals that support the 
following: 

Clocks 

B cache interface 

System interface 

Interrupts 

Test modes 

See Figure 1 for a block diagram of the 21164. 
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5.1 Clocks 

The 21164 accepts two clock signal inputs and develops three clock signal 
outputs: 



Signal 



Description 



Input Clocl< Signals 



osc_clk_in_h,l Differential inputs normally driven at two times the desired 

internal frequency. 

ref_clk_in_h A system-supplied clock to which the 21164 synchronizes its timing 

for multiprocessor systems. 



Output Clock Signals 



cpuclkouth 
sys_cl koutlh.l 
sys_cl k_out2_h,l 



A 21164 internal clock that may or may not drive the system clock. 

A clock of programmable speed supplied to the external interface. 

A delayed copy of sys_clk_outl_h,l. The delay is programmable 
and is an integer number of cpu_clk_out_h periods. 



Figure 6 shows the 21164 clock signals. 



Figure 6 Alpha 21164 Clock Signals 
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5.1.1 CPU Clock 

The 21164 uses the differential input clocl< lines osc_clk_in_h,l as a source 
to generate its CPU clock. The input signals clk_mode_h<l:0> control 

generation of the CPU clocl<. 

5.1.2 System Clock 

The CPU clocl< is divided by a programmable value of between 3 and 15 
to generate a system clock. The programmable feature allows the system 
designer maximum flexibility when choosing external logic to interface with 
the 21164. 

The sys_clk_outl_h,l signals are delayed by a programmable number of 
CPU cycles between and 7 to produce sys_clk_out2_h,l. The output of 
the programmable divider is symmetric if the divisor is even. The output is 
asymmetric if the divisor is odd. 

Figure 7 shows the 21164 driving the system clock on a uniprocessor system. 
Figure 7 Alpha 21164 Uniprocessor Clock 
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5.1.3 Reference Clock 

The 21164 provides a reference clock input so that other CPUs and system 
devices can be synchronized in multiprocessor systems. If a clock is asserted 
on signal ref_clk_in_h, then thesys_clk_outl_h,l signals are synchronized to 
that reference clock by means of a digital phase-locked loop (DP LL). Figures 
shows the 21164 synchronized to a system reference clock. 

Figure 8 Alpha 21164 Reference Clock for Multiprocessor Systems 
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5.2 Board-Level Backup Cache Interface 

The 21164 includes an interface and control for an optional board-level backup 
cache (Bcache). This section describes the Bcache interface. The Bcache 
interface is made up of the following: 

• A data bus (which it shares with the system interface) 

• Tag and tag control bits for determining hit and coherence 

• SRAM output and SRAM write control signals 
Figure 9 shows the 21164 system interface signals. 

Figure 9 Alpha 21164 Bcache Interface Signals 
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The Bcache interface is managed by the cache control and bus interface unit 
(Cbox). The Bcache interface is a 128-bit bidirectional data bus. The read 
and write speed of the Bcache can be programmed independently of each 
other and independently of the system clock ratio. Optionally, the Bcache 
can operate in a psuedo-pipeline manner. Internal processor registers are 
used to program the Bcache timing and to enable wave pipelining. See the 
Alpha 21164 Microprocessor H ardware Rd'erence M anual for more information. 

The Bcache system supports blocl< sizes of 32 or 64 bytes but it be must set 
lil<e the secondary cache (Scache). The blocl< size is selected by a mode bit. 
The Scache is 3-way, set-associative but is a subset of the larger externally 
implemented, direct-mapped Bcache. In systems with no Bcache, the Scache 
block size must be set to 64 bytes. 

5.2.1 Bcache Victim Buffers 

The 21164 is designed to support systems with one or more offchip Bcache 
victim buffers. External victim buffers improve the overall performance of 
the Bcache. A Bcache victim is generated when the 21164 deallocates a dirty 
block from the Bcache. Each time a Bcache victim is produced, the 21164 stops 
reading the Bcache until the system takes the current victim, and then the 
Bcache operations resume. 
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5.2.2 Cache Coherence Protocol 

Cache coherency is a concern for single and multiprocessor 21164-based 
systems as there may be several caches on a processor module and several 
more in multiprocessor systems. 

The system hardware designer need not be concerned about I cache and Dcache 
coherency. Coherency of the I cache is a software concern— it is flushed with 
an 1MB (PALcode) instruction. The 21164 maintains coherency between the 
Dcache and the Scache. 

If the system does not have a Bcache, the system designer must create 
mechanisms in the system interface logic to support cache coherency between 
the Scache, main memory, and other caches in the system. 

If the system has a Bcache, the 21164 maintains cache coherency between the 
Scache and the Bcache. The Scache is a subset of the Bcache. I n this case, 
the designer must create mechanisms in the system interface logic to support 
cache coherency between the Bcache, main memory, and other caches in the 
system. 

The following tasks must be performed to maintain cache coherency: 

• The Cbox in the 21164 maintains coherency in the Dcache and keeps it as 
a subset of the Scache. 

• If an optional Bcache is present, then the 21164 maintains the Scache as a 
subset of the Bcache. The Scache is set-associative but is kept a subset of 
the larger externally implemented direct-mapped Bcache. 

• System logic must help the 21164 to keep the Bcache coherent with main 
memory and other caches in the system. 

• The I cache is not a subset of any cache and also is not kept coherent with 
the memory system. 

Table 4 describes the Bcache states that determine cache coherence protocol for 
21164 systems. 
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Table 4 Bcache States for Cache Coherency Protocols 
Valid^ Shared^ Dirty^ State of Cache Line 

X X Not valid. 

10 Valid for read or write operations. This cache line 

contains the only cached copy of the block and the 
copy in memory is identical to this line. 

10 1 Valid for read or write operations. This cache line 

contains the only cached copy of the block. The 
contents of the block have been modified more recently 
than the copy in memory. 

110 Valid for read or write operations. This block may be 

in another CPU's cache. 

111 Valid for read or write operations. This block may 

be in another CPU's cache. The contents of the block 
have been modified more recently than the copy in 
memory. 

^Thetag_valid_h, tag_shared_h, and tag_dirty_h signals are described in Table 2. 
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5.3 System Interface 

The system interface is made up of bidirectional address and command buses, 
a data bus that it shares with the Bcache interface, and several control signals. 

Figure 10 shows the 21164 system interface signals. 
Figure 10 Alpha 21164 System Interface Signals 
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The system interface is under the control of the cache control and bus interface 
unit (Cbox). The system interface is a 128-bit bidirectional data bus. The 
cycle time of the system interface is programmable to speeds of one-third to 
one-fifteenth the CPU cycle time. All system interface signals are driven or 
sampled by the 21164 on the rising edge of sys_clk_outl_h. 

5.3.1 Commands and Addresses 

The 21164 can take up to two commands from the system at a time. The 
bus interface buffer can hold one or two misses and one or two Scache victim 
addresses at a time. A miss occurs when the 21164 searches its caches but 
does not find the addressed block. The 21164 can queue two misses to the 
system. An Scache victim occurs when the 21164 deallocates a dirty block from 
the Scache. 

The system requests the misses, and the victims arbitrate for the Bcache. 

• The highest priority for the Bcache is data movement for the system, which 
includes fill, read dirty data, invalidate, and set shared activities. 
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• If there are no system requests for the Bcache, then a 21164 command is 
selected. 

Tables 5 and 6 provide a brief description of the commands that the 21164 and 
the system can drive on the command bus. 



Table 5 Alpha 21164 Commands for the System 



cmd<3:0> 



Command 



Meaning 



0000 


NOP 


0001 


LOCK 


0010 


FETCH 


0011 


FETCH_M 


0100 


MEMORY BARRIER 


0101 


SET DIRTY 


Olio 


WRITE BLOCK 


0111 


WRITE BLOCK LOCK 


1000 


READ MISSO 


1001 


READ MISSl 


1010 


READ MISS MODO 


1011 


READ MISS MODI 


1100 


BCACHE VICTIM 


1101 


— 


1110 


READ MISS MOD 




STCO 


1111 


READ MISS MOD 




STCl 



Nothing. 

New lock register address. 

21164 passes a FETCH to system. 

21164 passes a FETCH_M to system. 

MB instruction. 

Dirty bit set if shared bit is clear. 

Request to write a block. 

Request to write a block with lock. 

Request for data. 

Request for data. 

Request for data; modify intent. 

Request for data; modify intent. 

Bcache victim should be removed. 

Spare. 

Request for data, STx_C data. 

Request for data, STx_C data. 
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Table 6 System Commands for the 21164 



cmd<3:0> 


Command 


Meaning 


0000 


NOP 


Nothing. 


0001 


FLUSH 


Remove block from caches; return dirty data 
(flush protocol). 


0010 


INVALIDATE 


Remove the block (write invalidate protocol). 


0011 


SET SHARED 


Block goes to the shared state (write invalidate 
protocol). 


0100 


READ 


Read a block (flush protocol). 


0101 


READ DIRTY 


Read a block; set shared (write invalidate 
protocol). 


0111 


READ DIRTY/I NV 


Read a block; invalidate (write invalidate 
protocol). 
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5.4 Interrupts 

The 21164 has seven interrupt signals that have different uses during 
initialization and normal operation. 

Figure 11 shows the 21164 interrupt signals. 
Figure 11 Alpha 21164 Interrupt Signals 
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5.4.1 Interrupt Signals During Initialization 

The 21164 interrupt signals work in tandem with the sys_reset_l signal to set 
the values for many of the user-selectable clocking ratios and interface timing 
parameters. During initialization, the 21164 reads system clock configuration 
parameters from the interrupt pins. 
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Table 7 shows the system clock divisor settings. The system clock frequency is 
determined by dividing the ratio into the CPU clock frequency. 

Table 7 System Clock Divisor 



irq_h<3> irq_h<2> lrq_h<1> irq_h<0> 



Ratio 



Low 


Low 


High 


High 


3 


Low 


High 


Low 


Low 


4 


Low 


High 


Low 


High 


5 


Low 


High 


High 


Low 


6 


Low 


High 


High 


High 


7 


High 


Low 


Low 


Low 


8 


High 


Low 


Low 


High 


9 


High 


Low 


High 


Low 


10 


High 


High 


High 


High 


15 



Table 8 shows how the three remaining interrupt signals are used to determine 
the length of the sys_clk_out2 delay. These signals provide flexible timing for 
system use. 



Table 8 System Clock Delay 



sy s_mc h_ch k_i rq_h 



pwr_fail_ 


Jrq_h 


mch_ 


halt. 


jrq_h 


Delay Cycles 


Low 




Low 









Low 




High 






1 


High 




Low 






2 


High 




High 






3 


Low 




Low 






4 


Low 




High 






5 


High 




Low 






6 


High 




High 






7 



Low 
Low 
Low 
Low 
High 
High 
High 
High 
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5.4.2 Interrupt Signals During Normal Operation 

During normal operation, interrupt signals request various interrupts as 
described in Table 2. 

5.5 Test Modes 

F igure 12 shows the 21164 test signals. 

Figure 12 Alpha 21164 Test Signals 
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The 21164 test interface port consists of 13 dedicated signals. Table 9 
summarizes the 21164 test port signals and their function. 



Table 9 Alpha 21164 Test Port Pins 



Pin Name 



Type Function 



portmodeh <1> 


1 


portmodeh <0> 


1 


srompresentj 


1 


srom_data_h/Rx 


1 


sromclkh/Tx 





srom_oe_l 





tdi_h 


1 


tdo_h 





tms_h 


1 


tck_h 


B 


trst_l 


1 


teststatush <0> 





test status h<l> 






Must be false. 

Must be false. 

Tied low if serial ROMs (SROMs) are present in 
system. 

Receives SROM or serial terminal data. 

Supplies clock to SROMs or transmits serial 
terminal data. 

SROM enable. 

IEEE 1149.1 TDI port. 

IEEE 1149.1 TDO port. 

IEEE 1149.1 TMS port. 

IEEE 1149.1 TCK port. 

IEEE 1149.1 optional TRST port. 

Indicates I cache BiSt status. 

Outputs an I PR-written value and timeout reset. 



5.5.1 Normal Test Interface Mode 

The test port is in the default or normal test interface mode when the 
port_mode_h<l:0> signals are tied to 00. In this mode, the test port supports 
the following: 

• Serial ROM interface port 

• Serial diagnostic terminal i nterf ace port 

• IEEE 1149.1 test access port 

5.5.2 Serial ROM Interface Port 

The following signals make up the serial ROM (SROM) interface: 

srompresentj 
sromdatah 
srom_oe_l 
srom elk h 



Preliminary — Subject to Change — July 1996 47 



During system reset, the 21164 samples the srom_present_l signal for the 
presence of SROM. If no SROMs are detected at reset, then srom_present_l 
is deasserted and the SROM load is disabled. The reset sequence clears the 
I cache valid bits, which causes the first instruction fetch to miss the I cache and 
seek instructions from offchip memory. 

If SROMs are present during setup, then the system performs an SROM load 
as follows: 

1. Thesrom_oe_l signal supplies the output enable to the SROM. 

2. The srom_clk_h signal supplies the clock to the ROM that causes it to 
advance to the next bit. The cycle time of this clock is 126± times the 
system clock ratio. 

3. The srom_data_h signal reads the SROM data. 

5.5.3 Serial Terminal Port 

After the serial ROM data is loaded into the I cache, the three SROM load 
signals become parallel I/O pins that can drive a diagnostic terminal such as 
an RS422. 

5.5.4 IEEE 1149.1 Test Access Port 

The test access port complies with all requirements of the IEEE 1149.1 (J TAG) 
standard. The following signals make up the test access port: 

• tms_h— Test access port select. 

• trst_l— Test access port reset. 

• tck_h— Test access port clock. 

• tdi_h and tdo_h— Input and output for serial boundary scan, die-ID, 
bypass, and instruction registers. 

5.5.5 Test Status Signals 

Thetest_status_h signals extract test status information from the chip. 

• The test_status_h<0> signal indicates when the I cache built-in self-test 
(BiSt) fails. 

• The test_status_h<l> signal detects unrepairable Icache by indicating 
more than two failing Icache rows. 
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6 Alpha Architecture Basics 

This section provides some basic information about the Alpha architecture. 
For more detailed information about the Alpha architecture, seethe/4/pha 
Architecture R dterence Manual. 

6.1 The Architecture 

The Alpha architecture is a 64-bit load and store RISC architecture designed 
with particular emphasis on speed, multiple instruction issue, multiple 
processors, and software migration from many operating systems. 

All registers are 64 bits in length and all operations are performed between 
64-bit registers. All instructions are 32 bits in length. Memory operations 
are either load or store operations. All data manipulation is done between 
registers. 

The Alpha architecture supports the following data types: 

• 8-, 16-, 32-, and 64-bit integers 

• IEEE 32-bit and 64-bit floating-point formats 

• VAX architecture 32-bit and 64-bit floating-point formats 

In the Alpha architecture, instructions interact with each other only by one 
instruction writing to a register or memory location and another instruction 
reading from that register or memory location. This use of resources makes 
it easy to build implementations that issue multiple instructions every CPU 
cycle. 

The 21164 uses a set of subroutines, called privileged architecture library 
code (PALcode), that is specific to a particular Alpha operating system 
implementation and hardware platform. These subroutines provide operating 
system primitives for context switching, interrupts, exceptions, and memory 
management. These subroutines can be invoked by hardware or CALL_PAL 
instructions. CALL_PAL instructions use the function field of the instruction 
to vector to a specified subroutine. PALcode is written in standard machine 
code with some implementation-specific extensions to provide direct access to 
low-level hardware functions. PALcode supports optimizations for multiple 
operating systems, flexible memory-management implementations, and 
multi-instruction atomic sequences. 

The Alpha architecture performs byte shifting and masking with normal 64-bit, 
register-to-register instructions; it does not include single-byte load and store 
instructions. 
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6.2 Addressing 

The basic addressable unit in the Alpha architecture is the 8-bit byte. The 
21164 supports a 43-bit virtual address. 

Virtual addresses as seen by the program are translated into physical memory 
addresses by the memory-management mechanism. The 21164 supports a 
40-bit physical address. 

6.3 Integer Data Types 

Alpha architecture supports four integer data types: 



Data Type 



Description 



Byte A byte is 8 contiguous bits that start at an addressable byte boundary. 

A byte is an 8-bJt value. A byte is supported in Alpha architecture by 
the EXTRACT, MASK, INSERT, and ZAP instructions. 

Word A word is 2 contiguous bytes that start at an arbitrary byte boundary. 

A word is a 16-bit value. A word is supported in Alpha architecture by 
the EXTRACT, MASK, and INSERT instructions. 

Longword A longword is 4 contiguous bytes that start at an arbitrary byte 

boundary. A longword is a 32-bit value. A longword is supported in 
the Alpha architecture by sign-extended load and store instructions and 
by longword arithmetic instructions. 

Quadword A quadword is 8 contiguous bytes that start at an arbitrary byte 

boundary. A quadword is supported in Alpha architecture by load and 
store instructions and quadword integer operate instructions. 



Note 



Alpha implementations may Impose a significant performance penalty 
when accessing operands that are not NATURALLY ALIGNED. Refer 
to the Alpha Architecture Reference M anual for details. 
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6.4 Floating-Point Data Types 

The 21164 supports the following floating-point data types: 

• Longword integer format in floating-point unit 

• Quadword integer format in floating-point unit 

• IEEE floati ng-poi nt formats 

S_floati ng 
- T_floating 

• VAX floating-point formats 

F_floating 
G_floating 
D_floating (limited support) 
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7 Alpha 21164 Microprocessor IEEE Floating-Point 
Conformance 

The 21164 supports the IEEE floating-point operations as defined by the Alpha 
architecture. Support for a complete implementation of the IEEE Standard 
for Binary Floating-Point Arithmetic (ANS\/\EEE Standard 754 1985) is 
provided by a combination of hardware and software as described in the/4/p/ia 
Architecture Reference Manual. 

Additional information about writing code to support precise exception 
handling (necessary for completeconformanceto the standard) is in the Alpha 
Architecture Reference Manual. 

The following information is specific to the 21164: 

• Invalid operation (INV) 

The invalid operation trap is always enabled. If the trap occurs, then the 
destination register is UNPREDICTABLE. This exception is signaled if any 
VAX architecture operand is nonfinite (reserved operand or dirty zero) and 
the operation can take an exception (that is, certain instructions, such as 
CPYS, never take an exception). This exception is signaled if any IEEE 
operand is nonfinite (NAN, INF, denorm) and the operation can take an 
exception. This trap is also signaled for an IEEE format divide of 4/- 
divided by +/- 0. If the exception occurs, then FPCR<INV> is set and the 
trap is signaled to the I box. 

• Divide-by-zero (DZE) 

The divide-by-zero trap is always enabled. I f the trap occurs, then the 
destination register is UNPREDICTABLE. For VAX architecture format, 
this exception is signaled whenever the numerator is valid and the 
denominator is zero. For IEEE format, this exception is signaled whenever 
the numerator is valid and non-zero, with a denominator of -h/- 0. If the 
exception occurs, then FPCR<DZE>is set and the trap is signaled to the 
I box. 

For IEEE format divides, 0/0 signals INV, not DZE. 

• F loati ng overflow (OVF ) 

The floating overflow trap is always enabled. If the trap occurs, then the 
destination register is UNPREDICTABLE. The exception is signaled if the 
rounded result exceeds in magnitude the largest finite number, which can 
be represented by the destination format. This applies only to operations 
whose destination is a floating-point data type. If the exception occurs, 
then FPCR<OVF>is set and the trap is signaled to the I box. 
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U nderflow (U N F ) 

The underflow trap can be disabled. If underflow occurs, then the 
destination register is forced to a true zero, consisting of a full 64 bits of 
zero. This is done even if the proper IEEE result would have been -0. The 
exception is signaled if the rounded result is smaller in magnitude than the 
smallest finite number that can be represented by the destination format. 
If the exception occurs, then FPCR<UNF> is set. If the trap is enabled, 
then the trap is signaled to the I box. The 21164 never produces a denormal 
number; underflow occurs instead. 

Inexact (INE) 

The inexact trap can be disabled. The destination register always contains 
the properly rounded result, whether the trap is enabled. The exception 
is signaled if the rounded result is different from what would have been 
produced if infinite precision (infinitely wide data) were available. For 
floating-point results, this requires both an infinite precision exponent and 
fraction. For integer results, this requires an infinite precision integer and 
an integral result. If the exception occurs, then FPCR<INE>is set. If the 
trap is enabled, then the trap is signaled to the I box. 

The I EEE-754 specification allows INE to occur concurrently with either 
OVF or UN F. Whenever OVF is signaled (if the inexact trap is enabled), 
INE is also signaled. Whenever UNF is signaled (if the inexact trap is 
enabled), INE is also signaled. The inexact trap also occurs concurrently 
with integer overflow. All valid opcodes that enable INE also enable both 
overflow and underflow. 

If a CVTQL results in an integer overflow (lOV), then FPCR<INE>is 
automatically set. (The INE trap is never signaled to the I box because 
there is no CVTQL opcode that enables the inexact trap.) 

I nteger overflow (I OV) 

The integer overflow trap can be disabled. The destination register 
always contains the low-order bits (<54>or <32>) of the true result (not 
the truncated bits). Integer overflow can occur with CVTTQ, CVTGQ, 
or CVTQL. I n conversions from floating to quadword integer or longword 
integer, an integer overflow occurs if the rounded result is outside the range 
-2^^ ..2^^"^. In conversions from quadword integer to longword integer, 
an integer overflow occurs if the result is outside the range -2^^ ..2^^~^. If 
the exception occurs, then the appropriate bit in the floating-point control 
register (FPCR) is set. If the trap is enabled, then the trap is signaled to 
the I box. 
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• Software completion (SWC) 

The software completion signal is not recorded in the FPCR. The state 
of this signal is always sent to the I box. If the I box detects the assertion 
of any of the listed exceptions concurrent with the assertion of the SWC 
signal, then it sets EXC_SUM<SWC>. 

I nput exceptions always take priority over output exceptions. If both exception 
types occur, then only the input exception is recorded in the FPCR and only 
the input exception is signaled to the I box. 
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8 Internal Processor Registers 

This section describes tine 21164 microprocessor internal processor registers 
(IPRs). It is organized as follows: 

Instruction fetch/decode unit and branch unit (I box) IPRs 

Memory address translation unit (Mbox) IPRs 

Cache control and bus interface unit (Cbox) IPRs 

PAL storage registers 

Restrictions 

I box, M box, data cache (Dcache), and PALtemp I PRs are accessible to PALcode 
by means of the HW_MTPR and HW_MFPR instructions. Table 10 lists the 
I PR numbers for these instructions. 

Cbox, second-level cache (Scache), and backup cache (Bcache) IPRs are 
accessible in the physical address region FF FFFO 0000 toFF FFFF FFFF. 
Table 34 summarizes the Cbox, Scache, and Bcache IPRs. Table 47 lists 
restrictions on the I PRs. 

Note for Windows NT 



For 21164-Pl and 21164-P2 users, the following bits must be set: 

• I BOX control and status register (ICSR<28>) SPE<0>must always 
be set (Section 8.1.17). Clearing this bit will cause 21164-Pn 
operation to be UNPREDICTABLE. 

• MBOX control register (MCSR<01>) SP<0> must always be set 
(Section 8.2.14). Clearing this bit will cause 21164-Pn operation to 
beUNPREDICTABLE. 



Note 



Unless explicitly stated, IPRs are not cleared or set by hardware on 
chip or timeout reset. 
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Table 10 Ibox, Mbox, Dcache, and PALtemp IPR Encodings 



IPR Mnemonic 



Access 



Indexi 



Ibox Slots to Pipe 



IboxIPRs 








ISR 


R 


100 


El 


ITB_TAG 


W 


101 


El 


ITB_PTE 


R/W 


102 


El 


ITB_ASN 


R/W 


103 


El 


ITB_PTE_TEIV1P 


R 


104 


El 


ITBJA 


W 


105 


El 


ITBJAP 


W 


106 


El 


ITBJS 


w 


107 


El 


SIRR 


R/W 


108 


El 


A5TRR 


R/W 


109 


El 


ASTER 


R/W 


lOA 


El 


EXC_ADDR 


R/W 


lOB 


El 


EXC_SUM 


R/WOC 


IOC 


El 


EXC_MASK 


R 


lOD 


El 


PAL_BASE 


R/W 


lOE 


El 


ICM 


R/W 


lOF 


El 


IPLR 


R/W 


110 


El 


INTID 


R 


111 


El 


IFAULT_VA_FORM 


R 


112 


El 


IVPTBR 


R/W 


113 


El 


HWINT_CLR 


W 


115 


El 


SL_XMIT 


W 


116 


El 


SL_RCV 


R 


117 


El 


ICSR 


R/W 


118 


El 


IC_FLUSH_CTL 


W 


119 


El 


ICPERR_STAT 


R/WIC 


llA 


El 


PMCTR 


R/W 


lie 


El 
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Table 10 (Cont.) 


Ibox, Mbox, Dcache, and 


PALtemp IPR Encodings 


IPR Mnemonic 


Access 


Indexi6 


Ibox Slots to Pipe 


PALtemplPRs 








PAL tempo 


R/W 


140 


El 


PAL tempi 


R/W 


141 


El 


PALtemp2 


R/W 


142 


El 


PALtempB 


R/W 


143 


El 


PALtemp4 


R/W 


144 


El 


PALtempS 


R/W 


145 


El 


PALtempe 


R/W 


146 


El 


PALtemp7 


R/W 


147 


El 


PALtempS 


R/W 


148 


El 


PALtemp9 


R/W 


149 


El 


PAL temp 10 


R/W 


14A 


El 


PALtempll 


R/W 


14B 


El 


PAL temp 12 


R/W 


14C 


El 


PAL temp 13 


R/W 


14D 


El 


PAL temp 14 


R/W 


14E 


El 


PAL temp 15 


R/W 


14F 


El 


PAL temp 15 


R/W 


150 


El 


PALtempl? 


R/W 


151 


El 


PALtemplS 


R/W 


152 


El 


PAL temp 19 


R/W 


153 


El 


PALtemp20 


R/W 


154 


El 


PALtemp21 


R/W 


155 


El 


PALtemp22 


R/W 


156 


El 


PALtemp23 


R/W 


157 


El 


MboxIPRs 








DTB_ASN 


W 


200 


EO 


DTB CM 


W 


201 


EO 



(continued on next page) 
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Table 10 (Cont.) Ibox, Mbox, Dcache, and PALtemp IPR Encodings 



IPR Mnemonic 



Access 



Indexi 



Ibox Slots to Pipe 



DTB_TAG 


W 


202 


EO 


DTB_PTE 


R/W 


203 


EO 


DTB_PTE_TEMP 


R 


204 


EO 


MM_STAT 


R 


205 


EO 


VA 


R 


206 


EO 


VA_FORM 


R 


207 


EO 


MVPTBR 


W 


208 


EO 


DTBJAP 


W 


209 


EO 


DTBJA 


W 


20A 


EO 


DTBJS 


W 


20B 


EO 


ALT_MODE 


W 


20C 


EO 


CC 


W 


20D 


EO 


CC_CTL 


w 


20E 


EO 


MCSR 


R/W 


20F 


EO 


DC_FLUSH 


W 


210 


EO 


DC_PERR_STAT 


R/WIC 


212 


EO 


DC_TEST_CTL 


R/W 


213 


EO 


DC_TEST_TAG 


R/W 


214 


EO 


DC_TEST_TAG_TEMP 


R/W 


215 


EO 


DC_MODE 


R/W 


216 


EO 


MAF MODE 


R/W 


217 


EO 
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8.1 Instruction Fetch/Decode Unit and Brancli Unit (Ibox) IPRs 

The Ibox internal processor registers (IPRs) are described in Section 8.1.1 
through Section 8.1.27. 

8.1.1 Istream Translation Buffer Tag Register (ITBTAG) 

ITB_TAG is a write-only register written by hardware on an 
ITBMISS/IACCVIO, with the tag field of the faulting virtual address. To 
ensure the integrity of the instruction translation buffer (ITB), the TAG and 
page table entry (PTE) fields of an ITB entry are updated simultaneously by 
a write operation to the ITB_PTE register. This write operation causes the 
contents of the ITB_TAG register to be written into the tag field of the ITB 
location, which is determined by a not-last-used replacement algorithm. The 
PTE field is obtained from the HW_MTPR ITB_PTE instruction. Figure 13 
shows the ITB_TAG register format. 

Figure 13 Istream Translation Buffer Tag Register (ITBTAG) 
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8.1.2 Instruction Translation Buffer Page Table Entry (ITBPTE) Register 

ITB_PTE is a read/write register. 

Write Format 

A write operation to this register writes botin tine PTE and TAG fields of an 
ITB location determined by a not-last-used replacement algorithm. The TAG 
and PTE fields are updated simultaneously to ensure the integrity of the ITB. 
A write operation to the I TB_PTE register increments the not-last-used (NLU) 
pointer, which allows for writing the entire set of ITB PTE and TAG entries. 
If the H W_MTPR ITB_PTE instruction falls in the shadow of a trapping 
instruction, the NLU pointer may be incremented multiple times. The TAG 
field of the ITB location is determined by the contents of the ITB_TAG register. 
The PTE field is provided by the HW_MTPR ITB_PTE instruction. Write 
operations to this register use the memory format bits, as described in the 
Alpha Architecture Rd'erence Manual. Figure 14 shows the ITB_PTE register 
write format. 



Figure 14 Instruction Translation Buffer Page Table Entry (ITBPTE) Register 
Write Format 
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Read Format 

A read of the ITB_PTE requires two instructions. A read of the ITB_PTE 
register returns the PTE pointed to by the NLU pointer to the ITB_PTE_ 
TEMP register and increments the NLU pointer. If the HW_MFPR ITB_PTE 
instruction falls in the shadow of a trapping instruction, the NLU pointer 
may be incremented multiple times. A zero value is returned to the integer 
register file. A second read of the ITB_PTE_TEMP register returns the PTE to 
the general purpose integer register file (IRF). Figure 15 shows the ITB_PTE 
register read format. 
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Figure 15 Instruction Translation Buffer Page Table Entry (ITBPTE) Register Read 
Format 
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8.1.3 Instruction Translation Buffer Address Space Number (ITBASN) Register 

ITB_A5N is a read/write register that contains tine address space number 
(ASN) of the current process. Figure 16 sinows tine ITB_ASN register format. 

Figure 16 Instruction Translation Buffer Address Space Number (ITBASN) 
Register 
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8.1.4 Instruction Translation Buffer Page Table Entry Temporary (ITBPTETEMP) 
Register 

ITB_PTE_TEMP is a read-only holding register for ITB_PTE read data. A read 
of the ITB_PTE register returns data to this register. A second read of the 
ITB_PTE_TEMP register returns data to the general purpose integer register 
file (IRF). Figure 15 shows the ITB_PTE register format. 

Table 11 shows the GHD settings for the I TB_PTE_TE MP register. 

Table 11 Granularity Hint Bits in ITB_PTE_TEMP Read Format 
Name Extent Type Description 

Set if granularity hint equals 01, 10, or 11. 
Set if granularity hint equals 10 or 11. 
Set if granularity hint equals 11. 

8.1.5 Instruction Translation Buffer Invalidate All Process (ITBJAP) Register 

ITB_IAP is a write-only register. Any write operation to this register 
invalidates all ITB entries that have an address space match (ASM) bit that 
equals zero. 

8.1.6 Instruction Translation Buffer Invalidate All (ITBJA) Register 

ITB_IA is a write-only register. A write operation to this register invalidates 
all ITB entries, and resets the ITB not-last-used (NLU) pointer to its initial 
state. RESET PALcode must execute an HW_MTPR ITB_IA instruction in 
order to initialize the NLU pointer. 



GHD 
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RO 


GHD 


<30> 


RO 


GHD 


<31> 


RO 
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8.1.7 Instruction Translation Buffer IS (ITBJS) Register 

ITB_IS is a write-only register. Writing a virtual address to this register 
invalidates tlie I TB entry tii at meets eitiier of tiie following criteria: 

• An ITB entry whose virtual address (VA) field matches ITB_IS<42:13>and 
whose ASN field matches ITB_ASN<10:04>. 

• An ITB entry whose VA field matches ITB_IS<42:13>and whose ASM bit 
is set. 

Figure 17 shows the ITB_IS register format. 

Figure 17 Instruction Translation Buffer IS (ITBJS) Register 
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8.1.8 Formatted Faulting Virtual Address (IFAULT_VA_FORM) Register 

IFAULT_VA_FORM is a read-only register containing the formatted faulting 
virtual address on an ITBMISS/IACCVIO (except on lACCVIOs generated 
by sign-check errors). The formatted faulting address generated depends 
on whether NT superpage mapping is enabled through ICSR bit SPE<0>. 
Figure 18 shows the IFAULT_VA_FORM register format in non-NT mode. 

Figure 18 Formatted Faulting Virtual Address (IFAULT_VA_FORM) Register 
(NT_Mode=0) 
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Figure 19 shows the I FAULT_VA_FORM register format in NT mode. 

Figure 19 Formatted Faulting Virtual Address (IFAULT_VA_FORM) Register 
(NT_Mode=1) 
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8.1.9 Virtual Page Table Base Register (IVPTBR) 

IVPTBR is a read/write register. Bits <32:30> are UNDEFINED on a read of 
tiiis register in non-NT mode. Figure 20 sinows tine IVPTBR format in non-NT 
mode. 

Figure 20 Virtual Page Table Base Register (IVPTBR) (NT_Mode=0) 
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Figure 21 shows the IVPTBR format in NT mode. 



Figure 21 Virtual Page Table Base Register (IVPTBR) (NT_Mode=1) 
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8.1.10 Icache Parity Error Status (ICPERR_STAT) Register 

ICPERR_STAT is a read/write register. Tine icaclne parity error status bits 
may be cleared by writing a 1 to the appropriate bits. Figure 22 and Table 12 
describe the ICPERR_STAT register format. 

Figure 22 Icache Parity Error Status (ICPERR_STAT) Register 
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Table 12 Icache Parity Error Status Register Fields 



Name Extent Type Description 



DPE 


<11> 


WIC 


Data parity error 


TPE 


<12> 


WIC 


Tag parity error 


TMR 


<13> 


WIC 


Timeout reset error or cfail_h/no cack_h error 



8.1.11 Icache Flush Control (IC_FLUSH_CTL) Register 

IC_FLUSH_CTL is a write-only register. Writing any value to this register 
flushes the entire Icache. 
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8.1.12 Exception Address (EXC_ADDR) Register 

EXC_ADDR is a read/write register used to restart the system after exceptions 
or interrupts. Tine HW_REI instruction causes a return to tine instruction 
pointed to by the EXC_ADDR register. This register can be written both 
by hardware and software. Hardware write operations occur as a result of 
exceptions/interrupts and CALL_PAL instructions. Hardware write operations 
that occur as a result of exceptions/interrupts take precedence over all other 
write operations. 

In case of an exception/interrupt, hardware writes a program counter (PC) 
to this register. I n case of precise exceptions, this is the PC value of the in- 
struction that caused the exception. I n case of imprecise exceptions/interrupts, 
this is the PC value of the next instruction that would have issued if the 
exception/interrupt was not reported. 

In case of a CALL_PAL instruction, the PC value of the next instruction after 
the CALL_PAL is written to EXC_ADDR. 

Bit <00> of this register is used to indicate PALmode. On a HW_REI 
instruction, the mode of the system is determined by bit <00>of EXC_ADDR. 
Figure 23 shows the EXC_ADDR register format. 

Figure 23 Exception Address (EXC_ADDR) Register 
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8.1.13 Exception Summary (EXC_SUM) Register 

EXC_SUM is a read/write register that records the different arithmetic traps 
that occur between EXC_SUIM write operations. Any write operation to this 
register clears bits <16:10>. Figure 24 and Table 13 describe the EXC_SUM 
register format. 

Figure 24 Exception Summary (EXC_SUI\/I) Register 
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Table 13 Exception Summary Register Fields 



Name 



Extent 



Type 



Description 



SWC 



<10> 



WA 



Indicates software completion possible. This bit 
is set after a floating-point instruction containing 
the/S modifier completes with an arithmetic trap 
and If all previous floating-point instructions that 
trapped since the last HW_MTPR EXC_SUM 
instruction also contained the/S modifier. 

The SWC bit is cleared whenever a floating-point 
instruction without the/S modifier completes 
with an arithmetic trap. The bit remains cleared 
regardless of additional arithmetic traps until the 
register is written by an HW_MTPR Instruction. 
The bit is always cleared upon any HW_MTPR 
write operation to the EXC_SUM register. 

(continued on next page) 
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Table 13 (Cont.) Exception Summary Register Fields 
Name Extent Type Description 

Indicates invalid operation. 

I ndicates divide by zero. 

Indicates floating-point overflow. 

Indicates floating-point underflow. 

Indicates floating inexact error. 

Indicates floating-point execution unit (Fbox) 
convert to integer overflow or integer arithmetic 
overflow. 



INV 


<11> 


WA 


DZE 


<12> 


WA 


FOV 


<13> 


WA 


UNF 


<14> 


WA 


INE 


<15> 


WA 


lOV 


<16> 


WA 
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8.1.14 Exception Mask (EXC_MASK) Register 

EXC_MASK is a read/write register tinat records the destinations of 
instructions tinat in a ve caused an aritinmetic trap between EXC_IMASK write 
operations. Tine destination is recorded as a single bit masl< in tine 64-bit I PR 
representing F0-F31 and 10-131. A write operation toEXC_SUIM clears tine 
EXC_MASK register. Figure 25 shows the EXC_M ASK register format. 

Figure 25 Exception Mask (EXC_MASK) Register 
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8.1.15 PAL Base Address (PAL_BASE) Register 

PAL_BASE is a read/write register containing the base address for PALcode. 
Tine register is cleared by hardware on reset. Figure 26 shows the PAL_BASE 
register format. 

Figure 26 PAL Base Address (PAL_BASE) Register 
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8.1.16 Ibox Current Mode (ICM) Register 

I CM is a read/write register containing the current mode bits of the 
architecturally defined processor status, as described in the Alpha Architecture 
Ref'erence Manual. Figure 27 shows the ICM register format. 

Figure 27 Ibox Current l\/lode (ICIUI) Register 
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8.1.17 Ibox Control and Status Register (ICSR) 

ICSR is a read/write register containing I box-related control and status 
information. Figure 28 and Table 14 describe ICSR format. 

Figure 28 Ibox Control and Status Register (ICSR) 
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Table 14 Ibox Control and Status Register Fields 



Name Extent Type Description 



PME<1:0> <09:08> RW,0 Performance counter masta" enable bits. If both 

PME<l>and PME<0> are clear, all performance 
counters in the PMCTR I PR are disabled. If 
either PME<l>or PME<0>areset, the counter is 
enabled according to the settings of the PMCTR 
CTL fields. 

IIV|SK<3:0> <23:20> RW,0 If set, each I MSK<3:0> signal disables the 

corresponding I RQ_H <3:0> interrupt. 

TMM <24> RW,0 If set, the timeout counter counts 5 thousand 

cycles before asserting timeout reset. If clear, 
tine timeout counter counts 1 billion cycles before 
asserting timeout reset. 

TMD <25> RW,0 If set, disables the Ibox timeout counter. Does 

not affect cfail_h/no cack_h error. 

FPE <26> RW,0 If set, floating-point instructions may be issued. 

If clear, floating-point instructions cause FEN 
exceptions. 

HWE <27> RW,0 If set, allows PALRES instructions to be issued 

in kernel mode. 

SPE<1:0> <29:28> RW,0 21164-266, 2Ue4^300, and 2U64-333 

If SPE<l>is set, it enables superpage mapping 
of I stream virtual address VA<39:13> directly 
to physical address PA <39:13> assuming 
VA<42:41> = 10. Virtual address bit VA<40> 
is ignored in this translation. Access is allowed 
only in kernel mode. 

If SPE<0> is set (NT mode), it enables 
superpage mapping of I stream virtual addresses 
VA<42:30> = IF F E 15 directly to physical address 
PA<39:30> = 0i6. VA<30:13>is mapped directly 
to PA<30:13>. Access is allowed only in kernel 
mode. 

21164-Pl and 21164-P2 

SPE<0>must always beset. Clearing this 
bit will cause 21164-Pn operation to be 
UNPREDICTABLE. 

(continued on next page) 
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Table 14 (Cont.) Ibox Control and Status Register Fields 



Name 



Extent 



Type 



Description 



SDE 


<30> 


RW,0 


CRDE 


<32> 


RW,0 


SLE 


<33> 


RW,0 


FMS 


<34> 


RW,0 


FBT 


<35> 


RW,0 


FBD 


<36> 


RW,0 


Resa"ved 


<37> 


RW,1 


ISTA 


<38> 


RO 



TST 



<39> 



RW,0 



If set, enables PAL shadow registers. 

If set, enables correctable error interrupts. 

If set, enables serial line interrupts. 

If set, forces miss on I cache references. MBZ in 
normal operation. 

If set, forces bad I cache tag parity. MBZ in 
normal operation. 

If set, forces bad I cache data parity. M BZ in 
normal operation. 

Reserved to Digital. Must be one. 

Reading this bit indicates I CACHE BIST status. 
If set, I CACHE BIST was successful. 

Writing a 1 to this bit asserts the 
test_status_h<l> signal . 
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8.1.18 Interrupt Priority Level Register (IPLR) 

I PLR is a read/write register tinat is accessed by PALcode to set the value of 
tine interrupt priority level (I PL). Whenever hardware detects an interrupt 
whose target I PL is greater than the value in IPLR<04:00>, an interrupt is 
taken. Figure 29 shows the IPLR register format. 

Figure 29 Interrupt Priority Level Register (IPLR) 
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8.1.19 Interrupt ID (INTID) Register 

INTID is a read-only register tinat is written by hardware witin the target 
I PL of the highest priority pending interrupt. The hardware recognizes an 
interrupt if the I PL being read is greater than the I PL given by IPLR<04:00>. 

I nterrupt service routines may use the value of this register to determine the 
cause of the interrupt. PALcode, for the interrupt service, must ensure that 
the I PL in I NTI D is greater than the I PL specified by I PLR. This restriction 
is required because a level -sensitive hardware interrupt may disappear before 
the interrupt service routine is entered (passive release). 

The contents of INTID are not correct on a HALT interrupt because this 
particular interrupt does not have a target I PL at which it can be masl<ed. 
When a HALT interrupt occurs, INTID indicates the next highest priority 
pending interrupt. PALcode for interrupt service must checl< the interrupt 
summary register (ISR) to determine if a HALT interrupt has occurred. 
F igure 30 shows the I NTI D register format. 

Figure 30 Interrupt ID (INTID) Register 
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8.1.20 Asynchronous System Trap Request Register (ASTRR) 

ASTRR is a read/write register containing bits to request asynchronous system 
trap (AST) interrupts in eacin of tine four processor modes (U,S,E,K). In order to 
generate an AST interrupt, tine corresponding enable bit in the ASTER must be 
set and the current processor mode given in the ICM<04:03>should be equal 
to or higher than the mode associated with the AST request. F igure 31 shows 
the ASTRR format. 

Figure 31 Asynchronous System Trap Request Register (ASTRR) 
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8.1.21 Asynchronous System Trap Enable Register (ASTER) 

ASTER is a read/write register containing bits to enable corresponding 
asyncl^ronous system trap (AST) interrupt requests. Figure 32 shows the 
ASTER format. 

Figure 32 Asynchronous System Trap Enable Register (ASTER) 



31 04 03 02 


01 00 




RAZ/IGN 
















L 


KAE 
EAE 

CAP 








UAE 



63 32 


RAZ/IGN 



LJ-03492-TI0 



80 Preliminary — Subject to Cliange — July 1996 



8.1.22 Software Interrupt Request Register (SIRR) 

SIRR is a read/write register used to control software interrupt requests. 
A software request for a particular I PL may be requested by setting tine 
appropriate bit in SIRR<15:01>. Figure 33 and TalDle 15 describe tine SIRR 
format. 

Figure 33 Software Interrupt Request Register (SIRR) 
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Table 15 


Software Interrupt Request Register Fields 


Name 


Extent 


Type 


Description 


SIRR<15:: 


> <18:04> 


RW 


Request software interrupts. 
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8.1.23 Hardware Interrupt Clear (HWINT_CLR) Register 

HWINT_CLR is a write-only register used to clear edge-sensitive hardware 
interrupt requests. Figure 34 and Table 16 describe tine HWINT_CLR register 
format. 

Figure 34 Hardware Interrupt Clear (HWINT_CLR) Register 
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Table 16 Hardware Interrupt Clear Register Fields 



Name 



Extent 



Type 



Description 



PCOC 


<27> 


WIC 


PCIC 


<28> 


WIC 


PC2C 


<29> 


WIC 


CROC 


<32> 


WIC 


SLC 


<33> 


WIC 



Clears performance counter interrupt requests. 
Clears performance counter 1 interrupt requests. 
Clears performance counter 2 interrupt requests. 
Clears correctable read data interrupt requests. 
Clears serial line interrupt requests. 
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8.1.24 Interrupt Summary Register (ISR) 

ISR is a read-only register containing information about all pending hardware, 
software, and asynchronous system trap (AST) interrupt requests. Figure 35 
and Table 17 describe the I SR format. 

Figure 35 Interrupt Summary Register (ISR) 
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Table 17 Interrupt Summary Register Fields 



Name 



Extent 



Type 



Description 



ASTRR<3:0> <03:00> 

and 

ASTER<3:0> 

SISR<15:1> <18:04> 



ATR 



<19> 



RO 



RO,0 



RO 



120 


<20> 


RO 


121 


<21> 


RO 


122 


<22> 


RO 


123 


<23> 


RO 


PCO 


<27> 


RO 


PCI 


<28> 


RO 


PC2 


<29> 


RO 


PFL 


<30> 


RO 


MCK 


<31> 


RO 


CRD 


<32> 


RO 


SLI 


<33> 


RO 


HLT 


<34> 


RO 



Boolean AND of ASTRR<JSEK> with 
ASTER^SEK>used to indicate enabled AST 
requests. 

Software interrupt requests 15 through 1 
corresponding to I PL 15 through 1. 

Set if any AST request and corresponding 
enable bit is set and if the processor mode is 
equal to or higher than the AST request mode. 

External hardware interrupt— irq_h<0>. 

External hardware interrupt— irq_h<l>. 

External hardware interrupt— irq_h<2>. 

External hardware interrupt— irq_h<3>. 

External hardware interrupt— performance 
counter 0(1 PL 29). 

External hardware interrupt— performance 
counter 1 (I PL 29). 

External hardware interrupt— performance 
counter 2 (I PL 29). 

External hardware interrupt— power failure 
(I PL 30). 

External hardware interrupt— system machine 
check (I PL 31). 

Correctable ECC errors (I PL 31). 

Serial line interrupt. 

External hardware interrupt— halt. 
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8.1.25 Serial Line Transmit (SL_XMIT) Register 

SL_XMIT is a write-only register used to transmit bit-serial data out of the 
microprocessor chip under the control of a software timing loop. The value 
of theTMT bit is transmitted offchip on the srom_clk_h signal. In normal 
operation mode (not in debugging mode), the srom_clk_h signal serves both 
the serial line transmission and the I cache serial ROM interface. Figure 36 
and Table 18 describe the SL_XM IT register format. 

Figure 36 Serial Line Transmit (SLXIUIIT) Register 
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Table 18 


Serial Line Transmit Register Fields 


Name 


Extent Type Description 


TMT 


<07> W0,1 Serial line transmit data 
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8.1.26 Serial Line Receive (SL_RCV) Register 

SL_RCV is a read-only register used to receive bit-serial data under the control 
of a software timing loop. The RCV bit in tine SL_RCV register is functionally 
connected to the srom_data_h signal. A serial line interrupt is requested 
whenever a transition is detected on the srom_data_h signal and the SLE 
bit in the ICSR is set. During normal operations (not in test mode), the 
srom_data_h signal serves both the serial line reception and the I cache serial 
ROM (SROM) interface. Figure 37 and Table 19 describe the SL_RCV register 
format. 

Figure 37 Serial Line Receive (SLRCV) Register 
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Table 19 Serial Line Receive Register Fields 



Name 



Extent 



Type 



Description 



RCV 



<06> 



RO 



Serial line receive data 
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8.1.27 Performance Counter (PMCTR) Register 

PMCTR is a read/write register that controls the three onchip performance 
counters. Figure 38 and Table 20 describe the PMCTR format. Performance 
counter interrupt requests are summarized in Section 8.1.24. Cbox inputs to 
the counter select options are described in Table 40. 

Note 



The arrangement of the select option tables is not meant to imply any 
restrictions on permitted combinations of selections. The only cases 
in which the selection for one counter influences another's count is 
SEL1=8(SEL 2=2, 3, other). 



Figure 38 Performance Counter (PMCTR) Register 
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Table 20 Performance Counter Register Fields 



Name 



Extent 



Type 



Description 



CTR0<15:0> <63:48> 



CTR1<15:0> 

SELO 

Ku 



<47:32> 

<31> 

<30> 



CTR2<13:0> <29:16> 
CTL0<1:0> <15:14> 



RW 

RW 
RW 
RW 

RW 
RW,0 



CTL1<1:0> <13:12> RW,0 



CTL2<1:0> <11:10> RW,0 



Kp 
Kk 



<09> RW 

<08> RW 



SEL1<3:0> <07:04> RW 
SEL2<3:0> <03:00> RW 



A 16-bit counter of events selected by SELO and 
enabled by CTL0<1:0>. 

A 16-bit counter. 

CounterO Select— refer to Table 21. 

Kill user mode— disables all counters in user 
mode (refer to Table 22). 

14-bit counter 

CTRO counter control: 

00 counter disable, interrupt disable 

01 counter enable, interrupt disable 

10 counter enable, interrupt at count 65536 
(Refer to Section 8.1.23 and Section 8.1.24.) 

11 counter enable, interrupt at count 256 

CTRl counter control: 

00 counter disable,interrupt disable 

01 counter enable, interrupt disable 

10 counter enable, interrupt at count 65536 

11 counter enable, interrupt at count 256 

CTR2 counter control: 

00 counter disable,interrupt disable 

01 counter enable, interrupt disable 

10 counter enable, interrupt at count 16384 

11 counter enable, interrupt at count 256 

Kill PALmode— disables all counters in 
PAL mode (refer to Table 22). 

Kill kernel, executive, supervisor mode- 
disables all counters in kernel, executive, 
and supervisor modes (refer to Table 22). 
Ku=l, Kp=l, and Kk=l enables counters in 
executive and supervisor modes only. 

Counterl Select— refer to Table 21. 

Counter2 Select— refer to Table 21. 
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Table 21 shows the PMCTR counter select options. 



Table 21 PMCTR Counter Select Options 



CounterO 
SELO<0> 



Counterl 
SEL1<3:0> 



Counter2 
SEL2<3:0> 



0:Cycles 



l:lnstructions 



0x0: nonissue cycles 

Valid instruction in S3 but none 

issued. 

0x1: split-issue cycles 

Some, but not all, instructions at 

S3 issued. 

0x2: pipe-dry cycles 

No valid instruction at S3. 

0x3: replay trap 

A replay trap occurred. 

0x4: single-issue cycles 
Exactly one instruction issued. 

0x5: dual -issue cycles 

Exactly two instructions issued. 

0x6: triple-issue cycles 
Exactly three instructions 
issued. 

0x7: quad-issue cycles 

Exactly four instructions issued. 

0x8: jsr-ret if sel2=PC-M 
Instruction issued if sel2 is 
PC-M. 

0x8: cond-branch if sel2=BR-M 
Instruction issued if sel2 is 
BR-M 

0x8: all flow-change instructions 
if sel2=! (PC-M or BR-M) 

0x9: IntOps issued 

OxA: FPOps issued 

OxB: loads issued 

OxC: stores issued 

OxD: I cache issued 



0x0: long(>15 cycle) stalls 



0x1: reserved 



0x2: PC-mispredicts 



0x3: BR-mispredicts 



0x4: 
0x5 
0x6 
0x7 
0x8 



Icache/RFB misses 
ITB misses 
Dcache LD misses 
DTB misses 
LDs merged in MAP 

(continued on next page) 



Preliminary — Subject to Ciiange — Juiy 1996 89 



Table 21 (Cont.) PMCTR Counter Select Options 



CounterO 
SELO<0> 



Counterl 
SEL1<3:0> 



Counter2 
SEL2<3:0> 



OxE : Dcache accesses 



OxF: pick CBOX input 1 



0x9: LDU replay traps 

OxA:WB/MAF full replay traps 

OxB: external perf_mon_h 
input. This counts in CPU 
cycles, but input is sampled 
in sysclk cycles. The external 
status perf_mon_h is sampled 
once per system clock and held 
through the system clock period. 
This means that "sysdock ratio" 
counts occur for each system 
clock cycle in which the status is 
true. 

OxC: CPU cycles 

OxD: MB stall cycles 

OxE: LDxL instructions issued 

OxF: pick CBOX input 2 
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Table 22 Measurement Mode Control 



Kill Bit Settings 
Measurement Mode Desired Ku Kp Kk 



Program 











PAL only 


1 





1 


OS only (kernel, executive, 
supervisor) 


1 


1 





User only 





1 


1 


All except PAL 





1 





OS + PAL (not user) 


1 








User +PAL (not kernel, 
executive, and supervisor) 








1 


Executive and supervisor only^ 


1 


1 


1 



■■^In this instance, Kk means kill kernel only. The combination Ku=l, Kp=l, and Kk=l is used to 
gather events for the executive and supervisor modes only. 



Note 



Both the user and the operating system can make PAL subroutine 
calls that put the machine in PALmode. The "OS only," "user only," 
and "executive and supervisor only" modes do not measure the events 
during the PAL subroutine calls made by the OS or user. The "OS + 
PAL" and "user + PAL" modes should be used carefully. "OS + PAL" 
mode measures the events during the PAL calls made by the user, 
whereas "user + PAL" mode measures the events during the PAL calls 
made by the OS. 
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8.2 Memory Address Translation Unit (Mbox) IPRs 

The Mbox internal processor registers (IPRs) are described in Section 8.2.1 
tl^rougl^ Section 8.2.23. 

8.2.1 Dstream Translation Buffer Address Space Number (DTBASN) Register 

DTB_ASN is a write-only register tinat must be written witin an exact duplicate 
of tl-ielTB_ASN register AS N field. Figure 39 shows the DTB_ASN register 
format. 

Figure 39 Dstream Translation Buffer Address Space Number (DTBASN) 
Register 



IGN 


63 




57 56 






32 


ASN<6:0> 


IGN 



92 Preliminary— Subject to Cliange—Juiy 1996 



8.2.2 Dstream Translation Buffer Current Mode (DTB_CM) Register 

DTB_CM is a write-only register tinat must be written witin an exac± duplicate 
of tine I box current mode (I CM) register CM field. TInese bits indicate tine 
current mode of the machine, as described in the Alpha Architecture Rdterence 
Manual. Figure 40 shows the DTB_CM register format. 

Figure 40 Dstream Translation Buffer Current Mode (DTBCM) Register 
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8.2.3 Dstream Translation Buffer Tag (DTB_TAG) Register 

DTB_TAG is a write-only register tinat writes tine DTB tag and the contents 
of tine DTB_PTE register to tine DTB. To ensure tine integrity of the DTBs, 
the DTB's PTE array is updated simultaneously from the internal DTB_PTE 
register when the DTB_TAG register is written. 

The entry to be written is chosen at the time of the DTB_TAG write operation 
by a not-last-used replacement algorithm implemented in hardware. A write 
operation to the DTB_TAG register increments the translation buffer (TB) 
entry pointer of the DTB, which allows writing the entire set of DTB PTE and 
TAG entries. TheTB entry pointer is initialized to entry zero and theTB valid 
bits are cleared on chip reset but not on timeout reset. Figure 41 shows the 
DTB_TAG register format. 

Figure 41 Dstream Translation Buffer Tag (DTBTAG) Register 
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8.2.4 Dstream Translation Buffer Page Table Entry (DTB_PTE) Register 

DTB_PTE is a read/write register representing the 64-entry DTB page 
table entries (PTEs). Tine entry to be written is cinosen by a not-last-used 
replacement algoritl^m implemented in Inardware. Write operations to 
DTB_PTE use tine memory format bit positions, as described in the Alpha 
Architecture Reference M anual , with tine exception that some fields are ignored. 
In particular, the page frame number (PFN) valid bit is not stored in the DTB. 

To ensure the integrity of the DTB, the PTE is actually written to a temporary 
register and is not transferred to the DTB until the DTB_TAG register is 
written. As a result, writing the DTB_PTE and then reading without an 
intervening DTB_TAG write operation does not return the data previously 
written to the DTB_PTE register. 

Read operations of the DTB_PTE require two instructions. First, a read 
from the DTB_PTE sends the PTE data to the DTB_PTE_TEMP register. A 
zero value is returned to the integer register file (IRF) on a DTB_PTE read 
operation. A second instruction reading from the DTB_PTE_TEMP register 
returns the PTE entry to the register file. Reading the DTB_PTE register 
increments the TB entry pointer of the DTB, which allows reading the entire 
set of DTB PTE entries. Figure 42 shows the DTB_PTE register format. 

Note 



The Alpha Architecture Reference Manual provides descriptions of the 
fields of the PTE. 
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Figure 42 Dstream Translation Buffer Page Table Entry (DTBPTE) Register — Write 
Format 
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8.2.5 Dstream Translation Buffer Page Table Entry Temporary (DTB_PTE_TEMP) 
Register 

DTB_PTE_TEMP is a read-only holding register used for DTB_PTE data. Read 
operations of the DTB_PTE require two instructions to return the PIE data to 
the register file. The first reads the DTB_PTE register to the DTB_PTE_TE MP 
register and returns zero to the register file. The second returns the DTB_ 
PTE_TEMP register to the integer register file (I RF). Figure 43 shows the 
DTB_PTE_TEMP register format. 

Figure 43 Dstream Translation Buffer Page Table Entry Temporary (DTBPTETEMP) 
Register 
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8.2.6 Dstream Memory Management Fault Status (MMSTAT) Register 

MM_STAT is a read-only register that stores information on Dstream faults 
and Dcache parity errors. The VA, VA_FORM, and MM_STAT registers are 
locked against further updates until software reads the VA register. The MM_ 
STAT bits are only modified by hardware when the register is not locked and 
a memory management error, DTB miss, or Dcache parity error occurs. The 
M M_STAT register is not unlocked or cleared on reset. Figure 44 and Table 23 
describe the MM_STAT register format. 

Figure 44 Dstream Memory Management Fault Status (MMSTAT) Register 
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Table 23 Dstream Memory Management Fault Status Register Fields 



Name 



Extent 



Type 



Description 



WR 


<00> 


RO 


ACV 


<01> 


RO 


FOR 


<02> 


RO 


FOW 


<03> 


RO 


DTB_MISS 


<04> 


RO 


BAD VA 


<05> 


RO 



Set if reference that caused error was a write 
operation. 

Set if reference caused an access violation. 
Includes bad virtual address. 

Set if reference was a read operation and the 
PTE FOR bit was set. 

Set if reference was a write operation and the 
PTE FOW bit was set. 

Set if reference resulted in a DTB miss. 

Set if reference had a bad virtual address. 

(continued on next page) 
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Table 23 (Cont.) Dstream Memory Management Fault Status Register Fields 
Name Extent Type Description 

RA <10:06> RO RA field of the faulting instruction. 

OPCODE <16:11> RO Opcode field of the faulting instruction. 
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8.2.7 Faulting Virtual Address (VA) Register 

VA is a read-only register. When Dstream faults, DTB misses, or Dcache 
parity errors occur, tine effective virtual address associated with the fault, miss, 
or error is latched in the VA register. The VA, VA_FORM, and MM_STAT 
registers are locked against further updates until software reads the VA 
register. The VA register is not unlocked on reset. Figure 45 shows the VA 
register format. 

Figure 45 Faulting Virtual Address (VA) Register 
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8.2.8 Formatted Virtual Address (VA_FORM) Register 

VA_FORM is a read-only register containing tine virtual page table entry (PTE) 
address calculated as a function of tine faulting virtual address and tine virtual 
page table base (VA and MVPTBR registers). Tiiis is done as a performance 
enliancement to tine Dstream TBmiss PAL flow. 

Tine virtual address is formatted as a 32-bit PTE when the NT_Mode bit 
(MCSR<01>) is set (see Figure 46). VA_FORM is locked on any Dstream fault, 
DTB miss, or Dcache parity error. The VA, VA_FORM, and MM_STAT registers 
are locked against further updates until software reads the VA register. The 
VA_FORM register is not unlocked on reset. Figure 47 shows the VA_FORM 
register format when MCSR<01>is clear. 

Figure 46 Formatted Virtual Address (VA_FORI\/l) Register (NT_l\/lode=1) 
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Figure 47 Formatted Virtual Address (VA_FORM) Register (NT_Mode=0) 
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Table 24 describes the VA_FORM register fields. 

Table 24 Formatted Virtual Address Register Fields 

Name Extent Type Description 

NT_Mode=0 

VPTB <53:33> RO Virtual page table base address as stored in 

MVPTBR 

VA<42:13> <32:03> RO Subset of the original faulting virtual address 

NT_Mode=1 

VPTB <53:30> RO Virtual page table base address as stored in 

MVPTBR 

VA<31:13> <21:03> RO Subset of the original faulting virtual address 
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8.2.9 Mbox Virtual Page Table Base Register (MVPTBR) 

M VPTBR is a write-only register containing tine virtual address of the base of 
the page table structure. It is stored in the Mbox to be used in calculating the 
VA_FORM value for the DstreamTBmiss PAL flow. UnliketheVA register, the 
MVPTBR is not locked against further updates when a Dstream fault, DTB 
M iss, or Dcache parity error occurs. Figure 48 shows the MVPTBR format. 

Figure 48 Mbox Virtual Page Table Base Register (l\/IVPTBR) 
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8.2.10 Dcache Parity Error Status (DC_PERR_STAT) Register 

DC_PERR_STAT is a read/write register that locks and stores Dcache parity 
error status. The VA, VA_FORIM, and |V!IV!_STAT registers are locked against 
further updates until software reads the VA register. If a Dcache parity error 
is detected while the Dcache parity error status register is unlocked, the 
error status is loaded into DC_PERR_STAT<05:02>. The LOCK bit is set and 
the register is locked against further updates (except for the SEO bit) until 
software writes a 1 to clear the LOCK bit. 

The SEO bit is set when a Dcache parity error occurs while the Dcache parity 
error status register is locked. Once the SEO bit is set, it is locked against 
further updates until the software writes a 1 to DC_PERR_STAT<00>to 
unlock and clear the bit. The SEO bit is not set when Dcache parity errors are 
detected on both pipes within the same cycle. I n this particular situation, the 
pipeO/pipel Dcache parity error status bits indicate the existence of a second 
parity error. The DC_PERR_STAT register is not unlocked or cleared on reset. 

Figure 49 and Table 25 describe the DC_PERR_STAT register format. 
Figure 49 Dcache Parity Error Status (DC_PERR_STAT) Register 
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Table 25 Dcache Parity Error Status Register Fields 

Name Extent Type Description 

SEO <00> WIC Set if second Dcache parity error occurred in a 

cycle after tlie register was locl<ed. Tlie SEO bit 
is not set as a result of a second parity error that 
occurs within the same cycle as the first. 

LOCK <01> WIC Set if parity error detected in Dcache. Bits 

<05:02>are locked against further updates when 
this bit is set. Bits <05:02> are cleared when the 
LOCK bit is cleared. 

Set on data parity error in Dcache bank 0. 

Set on data parity error in Dcache bank 1. 

Set on tag parity error in Dcache bank 0. 

Set on tag parity error in Dcache bank 1. 



DPO 


<02> 


RO 


DPI 


<03> 


RO 


TPO 


<04> 


RO 


TPl 


<05> 


RO 



Preliminary — Subject to Change — Juiy 1996 105 



8.2.11 Dstream Translation Buffer Invalidate All Process (DTBJAP) Register 

DTB_I AP is a write-only register. Any write operation to tinis register 
invalidates all data translation buffer (DTB) entries in winicin tine address 
space match (ASM ) bit is equal to zero. 

8.2.12 Dstream Translation Buffer Invalidate All (DTBJA) Register 

DTB_I A is a write-only register. Any write operation to this register 
invalidates all 64 DTB entries, and resets the DTB not-last-used (NLU) pointer 
to its initial state. 
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8.2.13 Dstream Translation Buffer Invalidate Single (DTBJS) Register 

DTB_IS is a write-only register. Writing a virtual address to this register 
invalidates the DTB entry that meets either of the following criteria: 

• A DTB entry whose VA field matches DTBJ S<42:13> and whose A5N field 
matches DTB_ASN<63:57>. 

• A DTB entry whose VA field matches DTB_I S<42:13> and whose ASM bit 
is set. 

Figure 50 shows the DTB_IS register format. 

Figure 50 Dstream Translation Buffer Invalidate Single (DTBJS) Register 
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Note 



The DTB_IS register is written before the normal I box trap point. The 
DTB invalidate single operation is aborted by the I box only for the 
following trap conditions: 

• ITB miss 

• PC mispredict 

• When the HW MTPR DTB IS is executed in user mode 
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8.2.14 Mbox Control Register (MCSR) 

MCSR is a read/write register that controls features and records status in the 
M box. This register is cleared on chip reset but not on timeout reset. Figure 51 
and Table 26 describe the MCSR format. 



Figure 51 Mbox Control Register (l\/ICSR) 
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Table 26 Mbox Control Register Fields 



Name 



Extent 



Type 



Description 



M BIG 
ENDIAN 


<00> 


SP<1:0> 


<02:01> 



RW,0 Mbox Big Endian mcx:le enable. When set, bit 2 of tlie 

pliysical address is inverted for all longword Dstream 
references. 

RW,0 2Ue4-266, 2Ue4-300, and 2Ue4-333 

Superpage mode enables. 

Note: Superpage access is only allowed in kernel mode. 

SP<l>enables superpage mapping when VA<42:41> = 2. 
I n this mode, virtual addresses VA<39:13>are mapped 
directly to physical addresses PA<39:13>. Virtual 
address bit VA<40>is ignored in this translation. 

SP<0> enables one-to-one superpage mapping of 
Dstream virtual addresses with VA<42:30> = lFFEi5. 
I n this mode, virtual addresses VA<29:13> are mapped 
directly to physical addresses PA^9:13>, with bits 
<39:30> of physical address set to 0. SP<0> is the 
NT_Mode bit that is used to control virtual address 
formatting on a read operation from the VA_FORM 
register. 

2U64^P1 and 2U64^P2 

SP<0> must always beset. Clearing this bit will cause 
21164-Pn operation to be UNPREDICTABLE. 

RW,0 Reserved to Digital. Must be zero (MBZ). 

RW,0 Ebox Big Endian mode enable. This bit is sent to the 

Ebox to enable Big Endian support for the EXTxx, 
MSKxxand 1 NSxxbyte instructions. This bit causes the 
shift amount to be inverted (one's-complemented) prior 
to the shifter operation. 

RW,0 Reserved to Digital. Must be zero (MBZ). 



Reserved 

E_B1G_ 
ENDIAN 



Reserved 



<03> 
<04> 



<05> 
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8.2.15 Dcache Mode (DC_MODE) Register 

DC_MODE is a read/write register tiiat controls diagnostic and test modes in 
the Dcacine. Tiiis register is cleared on cinip reset but not on timeout reset. 
Figure 52 and Table 27 describe tine DC_MODE register format. 

Note 



The following bit settings are required for normal operation: 

DC_ENA =1 
DC_FHIT =0 
DC_BAD_PARITY =0 
DC PERR DISABLE =0 



Figure 52 Dcache Mode (DC_MODE) Register 
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Table 27 Dcache Mode Register Fields 



Name Extent Type Description 



DC_ENA <00> RW,0 Software Dcache enable. The DC_EN A bit 

enables the Dcache unless the Dcache has 
been disabled in hardware (DC_DOA is set). 
(The Dcache is enabled if DC_ENA=1 and 
DC_DOA=0). When clear, the Dcache command 
is not updated by ST or FILL operations, 
and all LD operations are forced to miss in 
the Dcache. Must be one (MBO) in normal 
operation. 

DC_FHIT <01> RW,0 Dcache force hit. When set, the DC_F HIT 

bit forces all Dstream references to hit in the 
Dcache. Must be zero in normal operation. 

DC_BAD_ <02> RW,0 When set, the DC_BAD_PARITY bit inverts 

PARITY the data parity inputs to the Dcache on integer 

stores. This has the effect of putting bad data 
parity into the Dcache on integer stores that 
hit in the Dcache. This bit has no effect on 
the tag parity written to the Dcache during 
FILL operations, or the data parity written 
to the Cbox write data buffer on integer store 
instructions. 

Floating-point store instructions should not be 
issued when this bit is set because it may result 
in bad parity being written to the Cbox write 
data buffer. Must be zero (MBZ) in normal 
operation. 

DC_PERR_ <03> RW,0 When set, the DC_PERR_DISABLE bit disables 

DISABLE Dcache parity error reporting. When clear, 

this bit enables all Dcache tag and data parity 
errors. Parity error reporting is enabled during 
all other Dcache test modes unless this bit is 
explicitly set. Must be zero (MBZ) in normal 
operation. 
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8.2.16 Miss Address File Mode (MAF_MODE) Register 

MAF_MODE is a read/write register that controls diagnostic and test modes 
in tine IMbox miss address file (MAF). This register is cleared on chip reset. 
M AF_MODE<05> is also cleared on timeout reset. Figure 53 and Table 28 
describe the MAF_MODE register format. 

Note 



The following bit settings are required for normal operation: 

DREAD_NOMERGE =0 
WB_FLUSH_ALWAYS =0 
WB_NOMERGE =0 
MAF_ARB_DISABLE =0 
WB CNT DISABLE =0 



Figure 53 Miss Address File Mode (MAFMODE) Register 
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DREAD_NOMERGE 
WB_FLUSH_ALWAYS 
WB_NOMERGE 
IO_NMERGE 
WB_CNT_DISABLE 
MAF_ARB_DISABLE 
DREAD_PENDING (Read-Only) 
WB_PENDING (Read-only) 
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RAZ/IGN 
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Table 28 Miss Address File Mode Register Fields 



Name 



Extent 



Type 



Description 



10 N MERGE <03> RW,0 



DREAD_ <00> RW,0 Miss address file (MAP) DREAD Merge Disable. When 

NOMERGE set, this bit disables all merging in the DREAD portion 

of the MAP. Any load instruction that is issued when 
DREAD_NOMERGE is set is forced to allocate a new 
entry. Subsequent merging to that entry is not allowed 
(even if DREAD_NOMERGE is cleared). Must be zero 
(MBZ) in normal operation. 

WB_PLUSH_ <01> RW,0 When set, this bit forces the write buffer to flush 

ALWAYS whenever there is a valid WB entry. Must be zero 

(MBZ) in normal operation. 

WB_ <02> RW,0 When set, this bit disables all merging in the write 

NOMERGE buffer. Any store instruction that is issued when WB_ 

NOMERGE is set is forced to allocate a new entry. 

Subsequent merging to that entry is not allowed (even 

ifWB_NOMERGE is cleared). Must be zero (MBZ) in 

normal operation. 

When set, this bit prevents loads from I/O space 
(address bit <39>=1) from merging in the MAP. Should 
be zero (SBZ) in typical operation. 

When set, this bit disables the 64-cycle WB counter in 
the MAP arbiter. The top entry of the WB arbitrates 
at low priority only when a LDx_L instruction is issued 
or a second WB entry is made. M ust be zero (M BZ) in 
normal operation. 

When set, this bit disables all DREAD and WB requests 
in the MAP arbiter. WB_Reissue, Replay, Iref and MB 
requests are not blocked from arbitrating for the Scache. 
This bit is cleared on both timeout and chip reset. Must 
be zero (MBZ) in normal operation. 

<06> R,0 Indicates the status of the MAP DREAD file. When set, 

there are one or more outstanding DREAD requests 
in the MAP file. When clear, there are no outstanding 
DREAD requests. 

<07> R,0 This bit indicates the status of the MAP WB file. When 

set, there are one or more outstanding WB requests in 
the MAP file. When clear, there are no outstanding WB 
requests. 



WB_CNT_ 
DISABLE 



MAP_ARB 
DISABLE " 



DREAD_ 
PENDING 



WB_ 
PENDING 



<04> RW,0 



<05> RW,0 
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8.2.17 Dcache Flush (DC_FLUSH) Register 

DC_FLUSH is a write-only register. A write operation to this register clears 
all the valid bits in both banks of the Dcache. 

8.2.18 Alternate Mode (ALT_MODE) Register 

ALT_MODE is a write-only register that specifies the alternate processor 
mode used by some HW_LD and HW_ST instructions. Figure 54 and Table 29 
describethe ALT_MODE register format. 

Figure 54 Alternate Mode (ALTMODE) Register 
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Table 29 Alternate Mode Register Settings 



ALT MODE<04:03> Mode 



00 


Kernel 


1 


Executive 


10 


Super visa- 


1 1 


User 
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8.2.19 Cycle Counter (CC) Register 

CC is a read/write register. The 21164 supports it as described in the Alpha 
Architecture Rdterence M anual . Tine low Inalf of tine counter, when enabled, 
increments once each CPU cycle. The upper half of the CC register is the 
counter offset. An HW_MTPR instruction writes CC<63:32>. Bits <31:00>are 
unchanged. CC_CTL<32> is used to enable or disable the cycle counter. The 
CC<31:00> is written toCC_CTL by an HW_MTPR instruction. 

The CC register is read by the RPCC instruction as defined in the Alpha 
Architecture Rdterence Manual . The RPCC instruction returns a 64-bit 
value. The cycle counter is enabled to increment only three cycles after the 
MTPR CC_CTL (with CC_CTL<32>set) instruction is issued. This means 
that an RPCC instruction issued four cycles after an HW_MTPR CC_CTL 
instruction that enables the counter reads a value that is one greater than the 
initial count. 

The CC register is disabled on chip reset. Figure 55 shows the CC register 
format. 

Figure 55 Cycle Counter (CC) Register 
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8.2.20 Cycle Counter Control (CC_CTL) Register 

CC_CTL is a write-only register that writes the low 32 bits of the cycle counter 
to enable or disable the counter. Bits CC<31:04> are written with the value 
in CC_CTL<31:04>on a HW_MTPR instruction totheCC_CTL register. 
Bits CC<03:00> are written with zero. Bits CC<63:32>are not changed. If 
CC_CTL<32> is set, then the counter is enabled; otherwise, the counter is 
disabled. Figure 56 and Table 30 describe the CC_CTL register format. 

Figure 56 Cycle Counter Control (CC_CTL) Register 
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Table 30 Cycle Counter Control Register Fields 



Name 



Extent 



Type 



Description 



COUNT<31:04> <31:04> WO 
CC ENA <32> WO 



Cycle count. This value is loaded into 
CC<31:04>. 

Cycle Counter enable. When set, this bit 
enables the CC register to begin incrementing 
3 cycles later. An RPCC issued 4 cycles after 
CC_CTL<32> is written "sees" the initial count 
incremented by 1. 
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8.2.21 Dcache Test Tag Control (DC_TEST_CTL) Register 

DC_TEST_CTL is a read/write register used exclusively for testing and 
diagnostics. An address written totinis register is used to index into tine Dcacine 
array winen reading or writing to tine DC_TEST_TAG register. Figure 57 and 
Table 31 describe tine DC_TEST_CTL register format. Section 8.2.22 describes 
how tinis register is used. 

Figure 57 Dcache Test Tag Control (DC_TEST_CTL) Register 
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Table 31 Dcache Test Tag Control Register Fields 



Name 



Extent 



Type 



Description 



BANKO 



BANKl 



<00> RW 



<01> RW 



INDEX<12:3> <12:03> RW 



Dcache BankO enable. When set, reads from 
DC_TEST_TAG return the tag from Dcache 
bankO, writes to DC_TEST_TAG write to Dcache 
bankO. When clear, reads from DC_TEST_TAG 
return the tag from Dcache bankl. 

Dcache Bankl enable. When set, writes to 
DC_TEST_TAG write to Dcache bankl. This 
bit has no effect on reads. 

Dcache tag index. This field is used on reads 
from and writes to the DC_TEST_TAG register to 
index into the Dcache tag array. 
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8.2.22 Dcache Test Tag (DC_TEST_TAG) Register 

DC_TE5T_TAG is a read/write register used exclusively for testing and 
diagnostics. When DC_TEST_TAG is read, the value in the DC_TEST_CTL 
register is used to index into the Dcache. The value in the tag, tag parity, valid 
and data parity bits for that index are read out of the Dcache and loaded into 
the DC_TEST_TAG_TEMP register. A zero value is returned to the integer 
register file (IRF). If BANKO is set, the read operation is from Dcache bankO. 
Otherwise, the read operation is from Dcache bankl. 

When DC_TEST_TAG is written, the value written to DC_TEST_TAG is 
written to the Dcache index referenced by the value in the DC_TEST_CTL 
register. The tag, tag parity, and valid bits are affected by this write operation. 
Data parity bits are not affected by this write operation (use DC_MODE<02> 
and force hit modes). If BANKO is set, the write operation is to Dcache bankO. 
If BANKl is set, the write operation is to Dcache bankl. If both are set, both 
banks are written. 

Figure 58 and Table 32 describe the DC_TEST_TAG register format. 
Figure 58 Dcache Test Tag (DC_TEST_TAG) Register 
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Table 32 Dcache Test Tag Register Fields 



Name 



Extent 



Type 



Description 



TAG PARITY <02> WO 



OWO VALID <11> WO 



OWl VALID <12> WO 



TAG<38:13> <38:13> WO 



Tag parity. This bit refa"s to tlie Dcaclie tag 
parity bit tliat covers tag bits 38 tlirougli 13 
(valid bits not covered). 

Octaword valid bit 0. This bit refers to the 
Dcache valid bit for the low-order octaword 
within a Dcache 32-byte block. 

Octaword valid bit 1. This bit refers to the 
Dcache valid bit for the high-order octaword 
within a Dcache 32-byte block. 

TAG<38:13>. These bits refer to the tag field in 
the Dcache array. 

Note: Bit 39 is not stored in the array. 
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8.2.23 Dcache Test Tag Temporary (DC_TEST_TAG_TEMP) Register 

DC_TEST_TAG_TEMP is a read-only register used exclusively for testing and 
diagnostics. 

Reading the Dcache tag array requires a two-step read process: 

1. The first read operation from DC_TE ST_TAG reads the tag array and data 
parity bits and loads them into the DC_TEST_TAG_TEMP register. An 
UNDEFINED value is returned to the integer register file (IRF). 

2. The second read operation of the DC_TEST_TAG_TEMP register returns 
the Dcache test data to the integer register file (IRF). 

Figure 59 and Table 33 describe the DC_TE5T_TAG_TEMP register format. 
Figure 59 Dcache Test Tag Temporary (DC_TEST_TAG_TEMP) Register 
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Table 33 Dcache Test Tag Temporary Register Fields 



Name 



Extent 



Type 



Description 



TAG PARITY <02> RO 



DATA PARO<0> <03> RO 



DATA PAR0<1> <04> RO 



DATA PAR1<0> <05> RO 



DATA PAR1<1> <06> RO 



OWO VALID <11> RO 



OWl VALID <12> RO 



TAG<38:13> <38:13> RO 



Tag parity. Tills bit refers to tiie Dcaciie tag parity 
bit tiiat covers tag bits 38 tiirougii 13 (valid bits not 
covered). 

Data parity. Tills bit refers to tiie BankO Dcaciie 
data parity bit tiiat covers tiie lower longword of data 
Indexed by DC_TEST_CTL<12:03>. 

Data parity. Tills bit refers to tiie Banl<0 Dcachie data 
parity bit tiiat covers tiie upper longword of data 
Indexed by DC_TEST_CTL<12:03>. 

Data parity. Tills bit refers to tiie Bani<l Dcaciie 
data parity bit tiiat covers tiie lower longword of data 
Indexed by DC_TEST_CTL<12:03>. 

Data parity. Tills bit refers to tiie Banl<l Dcaciie data 
parity bit tiiat covers tiie upper longword of data 
Indexed by DC_TEST_CTL<12:03>. 

Octaword valid bit 0. Tills bit refers to tiie Dcaciie valid 
bit for tiie low-order octaword witiiln a Dcaciie 32-byte 
block. 

Octaword valid bit 1. Tills bit refers to tiie Dcaciie valid 
bit for tiie iilgii-order octaword witiiln a Dcaciie 32-byte 
block. 

TAG<38:13>. Tiiese bits refer to tiie tag field In tiie 
Dcaciie array. 

Note: Bit 39 Is not stored In tiie array. 
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8.3 External Interface Control (Cbox) IPRs 

Table 34 lists specific I PRs for controlling Scache, Bcache, system configuration, 
and logging error information. These I PRs cannot be read or written from the 
system. They are placed in the 1MB region of 21164-specific I/O address space 
ranging from FF FFFOOOOOtoFF FFFF FFFF. Any read or write operation to 
an undefined I PR in this address space produces UNDEFINED behavior. The 
operating system should not map any address in this region as writable in any 
mode. 

The Cbox internal processor registers are described in Section 8.3.1 through 
Section 8.3.9. 

Table 34 Cbox Internal Processor Register Descriptions 



Register 


Address 


Type^ 


Description 


SC_CTL 




FFF0 00A8 


RW 


Controls Scache behavior. 


SC_STAT 




FFF0 00E8 


R 


Logs Scache-rdated errors. 


SC_ADDR 




FFF0 0188 


R 


Contains the address for Scache- 
related errors. 


BC_CONTROL 




FFF0 0128 


W 


Controls Bcache/system interface 
and Bcache testing. 


BC_CONFIG 




FFF0 01C8 


W 


Contains Bcache configuration 
parameters. 


BC_TAG_ADDR 




FFF0 0108 


R 


Contains tag and control bits for 
FILLS from Bcache. 


EI_STAT 




FFF0 0168 


R 


Logs Bcache/system-related errors. 


EI_ADDR 




FFF0 0148 


R 


Contains the address for 
Bcache/system-related errors. 


FILL_SYN 


FF 


FFF0 0068 


R 


Contains fill syndrome or parity 
bits for Fl LLs from Bcache or main 
memory. 


iBC_CONTROL<01>must be when readi 


ng any IPR 


in this table. 
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8.3.1 Scache Control (SC_CTL) Register (FF FFFO 00A8) 

SC_CTL is a read/write register that controls Scache activity. Figure 60 and 
Table 35 describe the SC_CTL register format. The bits in this register are 
initialized to the value indicated in Table 35 on reset, but not on timeout 
reset. 

Figure 60 Scache Control (SC_CTL) Register 
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Table 35 Scache Control Register Fields 



Field 



Extent 



Type 



Description 



SC FHIT 



<00> 



RW,0 



SC_FLUSH 

SC_TAG_ 
STAT<5:0> 



<01> RW,0 

<07:02> RW,0 



When set, this bit forces cacheable load 
and store instructions to hit in the 
Scache, irrespective of the tag status bits. 
Noncacheable references are not forced to 
hit in the Scache and will be driven offchip. 
I n this mode, only one Scache set may be 
enabled. The Scache tag and data parity 
checking are disabled. 

For store instructions, the value of the 
tag status and parity bits are specified by 
the SC_TAG_STAT<5:0> field. The tag is 
written with the address provided to the 
Scache with the store instruction. 

All the Scache tag valid bits are cleared 
every time this bit field is written to 1. 

This field is used only in the SC_FHIT mode 
to write any combination of tag status and 
parity bits in the Scache. The parity bit can 
be used to write bad tag parity. The correct 
value of tag parity is even. 

The following bits must be zero for normal 
operation: 

Scache Tag 

Status<5:0> Description 



SC_TAG_ 
STAT<5:2> 



SC_TAG_ 
STAT<1:0> 



Tag parity, valid, 
shared, dirty; 
bits 7, 6, 5, and 4 
respectively 

Octaword modified 
bits 



(continued on next page) 
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Table 35 (Cont.) Scache Control Register Fields 



Field 



Extent 



Type 



Description 



SC FB DP<3:0> <11:08> RW,0 



SC BLK SIZE 



<12> 



RW,1 



SC SET EN<2:0> <15:13> RW,7 



Reserved 



<18:16> RW,0 



Force bad parity— This field is used to write 
bad data parity for tiie selec±ed longwords 
witiiin tiie octaword wiien writing the 
Scache. I f any one of these bits is set to one, 
then the corresponding longword's computed 
parity value is inverted when writing the 
Scache. 

For Scache write transactions, the 
Cbox allocates two consecutive cycles to 
write up to two octawords based on the 
longword valid bits received from the Mbox. 
Therefore, the same longword parity control 
bits are used for writing both octawords. 
For example, SC_FB_DP<0> corresponds to 
LWO and LW4. This bit field must be zero 
during normal operation. 

This bit selects the Scache and Bcache block 
size to be either 64 bytes or 32 bytes. The 
Scache and Bcache always have identical 
block sizes. All the Bcache and main 
memory FILLs or write transactions are 
of the selected block size. At power-up time, 
this bit is set and the default block size 
is 64 bytes. When clear, the block size is 
32 bytes. This bit must be set to the desired 
value to reflect the correct Scache/Bcache 
block size before the 21164 does the first 
cacheable read or write transaction from 
Bcache or system. 

This field is used to enable the Scache sets. 
Only one or all three sets may be enabled 
at a time. Enabling any combination of two 
sets at a time results in UNPREDICTABLE 
behavior. One of the Scache sets must 
always be enabled irrespective of the 
Bcache. 

Reserved to Digital. Must be zero (MBZ). 
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8.3.2 Scache Status (SC_STAT) Register (FF FFFO 00E8) 

SC_5TAT is a read-only register. It is not cleared or unlocked by reset. Any 
PALcode read of this register unlocks SC_ADDR and SC_5TAT and clears 
SC_STAT. 

If an Scache tag or data parity error is detected during an Scache lookup, 
the SC_STAT register is locked against further updates from subsequent 
transactions. Figure 61 and Table 36 describe the SC_STAT register format. 

Figure 61 Scache Status (SC_STAT) Register 
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Table 36 Scache Status Register Fields 



Field Extent Type Description 



SC_TPERR<2:0> <02:00> RO When set, these bits indicate that an 

Scache tag lool<up resulted in a tag parity 
error and identify the set that had the tag 
parity error. 

SC_DPERR<7:0> <10:03> RO When set, these bits indicate that an 

Scache read transaction resulted in a data 
parity error and indicate which longword 
within the two octawords had the data 
parity error. These bits are loaded if any 
longword within two octawords read from 
the Scache during lookup had a data parity 
error. If SC_FHIT (SC_CTL<00>) is set, 
this field is used for loading the longword 
parity bits read out from the Scache. 

SC_CMD<4:0> <15:11> RO This field indicates the Scache transaction 

that resulted in a Scache tag or data 
parity error. This field is written at the 
time the actual Scache error bit is written. 
The Scache transaction may be DREAD, 
I READ, or WRITE command from the 
Mbox, Scache victim command, or the 
system command being serviced. Refer to 
Table 37 for field encoding. 

SC_SCND_ERR <16> RO When set, this bit indicates that an Scache 

transaction resulted in a parity error 
while the SC_TPERR or SC_DPERR 
bit was already set from the earlier 
transaction. This bit is not set for two 
errors in different octawords of the same 
transaction. 
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Table 37 SC_CMD Field Descriptions 



SC CMD<4:3> Source 



SC_CMD<2:0> Encoding 



Description 



Ix 



00 
01 



110 

101 

100 

001 
001 
001 

oil 



Set shared from 
system 

Read dirty from 
system 

Invalidate from 
system 

Scaclie victim 

ScaciielREAD 

Scachie DREAD 

Scaclie DWRITE 
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8.3.3 Scache Address (SC_ADDR) Register (FF FFFO 0188) 

SC_ADDR is a read-only register. It is not cleared or unlocked by reset. The 
address is loaded into this register every time the Scache is accessed if one 
of the error bits in the SC_STAT register is not set. If an Scache tag or data 
parity error is detected, then this register is locked preventing further updates. 
This register is unlocked whenever SC_STAT is read. 

For Scache read transactions, address bits <39:04>are valid to identify the 
address being driven to the Scache. Address bit <04> identifies which octaword 
was accessed first. For each Scache lookup, there is one tag access and two 
data access cycles. If there is a hit, two octawords are read out in consecutive 
CPU cycles. Tag parity error is detected only while reading the first octaword. 
However, data parity error can be detected on either of the two octawords. 
SC_ADDR<39> is always zero. 

If SC_CTL<00> is set (force hit mode), SC_ADDR is used for storing the 
Scache tag and status bits. For each tag in the Scache, there are unique valid, 
shared, and dirty bits for a 32-byte subblock, and modify bits for each octaword 
(16 bytes). There is a single tag and a parity bit for two consecutive 32-byte 
subblocks. In force hit mode, only reads and probes load tag and status into 
the SC_ADDR register. In this mode, tag and data parity checking are disabled 
and the SC_ADDR and SC_STAT registers are not locked on an error. 

I n force hit mode, to write the Scache and read back the same block and 
corresponding tag status bits, a minimum of 5-cycle spacing is required 
between the Scache write and read of the SC_ADDR or SC_STAT. 

Figure 62 and Table 38 describe the SC_ADDR register format. 
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Figure 62 Scache Address (SC_ADDR) Register 
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Table 38 Scache Address Register Fields 



Name 


Extent 


Type 


Description 


Normal Mode 


SC_ADDR<38:04> 


<38:04> 


RO 


Scache address. 


Force Hit Mode 


TP 


<04> 


RO 


Scache tag parity bit. 


VO 


<05> 


RO 


Subblocl<0 tag valid bit. 


SO 


<06> 


RO 


Subblocl<0 tag shared bit. 


DO 


<07> 


RO 


Subblocl<0 tag dirty bit. 


VI 


<08> 


RO 


Subblocl<l tag valid bit. 


SI 


<09> 


RO 


Subblockl tag shared bit. 


Dl 


<10> 


RO 


Subblockl tag dirty bit. 


MO 


<12,11> 


RO 


Octawords modified for subblockO. 


Ml 


<14,13> 


RO 


Octawords modified for subblockl. 


TAG<38:15> 


<38:15> 


RO 


Scache tag. 
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8.3.4 Bcache Control (BC_CONTROL) Register (FF FFFO 0128) 

BC_CONTROL is a write-only register. It is used to enable and control the 
external Bcache. Figure 63 and Table 39 describe the BC_CONTROL register 
format. 

Figure 63 Bcache Control (BC_CONTROL) Register 
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Table 39 Bcache Control Register Fields 



Field Extent Type Description 



BC_ENABLED^ <00> WO,0 When set, the external Bcache is enabled. When 

clear, the Bcache is disabled. When the Bcache is 
disabled, the BID does not perform external cache 
read or write transactions. 

ALLOC_CYC <01> WO,0 When set, the issue unit does not allocate a 

cycle for noncacheable fill data. When clear, the 
instruction issue unit allocates a cycle for returning 
noncacheable fill data to be written to the Dcache. I n 
either case, a cycle is always allocated for cacheable 
integer fill data. If this bit is clear, the latency for 
all noncacheable read operations increases by 1 CPU 
cycle. 

Note: This bit must be clear before reading any 
Cbox I PR. It can be set when reading all other I PRs 
and noncacheable LDs. 

EI_CMD_GRP2 <02> WO,0 When set, the optional commands, LOCK and 

SET Dl RTY are driven to the 21164 external 
interface command pins to be acknowledged by 
the system interface. When clear, the SET Dl RTY 
command is not driven to the command pins. It is 
UNPREDICTABLE if the LOCK command is driven 
to the pins. However, the system should never 
CACK the LOCK command if this bit is clear. 

EI_CMD_GRP3 <03> WO,0 When set, the M B command is driven to the 21164 

external interface command pins to be acknowledged 
by the system interface. When clear, the MB 
command is not driven to the command pins. 

CORR_FILL_DAT <04> W0,1 Correct fill data from Bcache or main memory, in 

ECC mode. When set, fill data from Bcache or main 
memory first goes through error correction logic 
before being driven to the Scache or Dcache. If the 
error is correctable, it is transparent to the system. 

When clear, fill data from Bcache or main memory is 
driven directly to the Dcache before an ECC error is 
detected. If the error is correctable, corrected data is 
returned again, Dcache is invalidated, and an error 
trap is taken. 

This bit should be clear during normal operation. 

^When dear, the read speed (BC_RD_SPD<3:0>) and the write speed (BC_WR_SPD<3:0>) must be equal to 
the sysdk to CPU clock ratio. 

(continued on next page) 
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Table 39 (Cont.) Bcache Control Register Fields 



Field 



Extent Type 



Description 



VTM FIRST 



<05> W0,1 



EI_ECC_OR 
PARITY 



BC FHIT 



<06> W0,1 



<07> WO,0 



BC_TAG_ 
STAT<4:0> 



<12:08> WO 



This bit is set for systems witliout a victim buffer. 
On a Bcaciie miss, tlie 21164 first drives out tlie 
victimized blocl<'s address on tlie system address bus, 
followed by the read miss address and command. 
This bit is cleared for systems with a victim buffer. 
On a Bcache miss with victim, the 21164 first drives 
out the read miss followed by the victim address and 
command. 

When set, the 21164 generates or expects quadword 
ECC on the data check pins. When clear, the 21164 
generates or expects even-byte parity on the data 
check pins. 

Bcache force hit. When set, and the Bcache is 
enabled, all references in cached space are forced 
to hit in the Bcache. A FILL to the Scache is 
forced to be private. Software should turn off 
BC_CONTROL<02>to allow clean to private 
transitions without going to the system. 

For write transactions, the values of tag status and 
parity bits are specified by the BC_TAG_STAT field. 
Bcache tag and index are the address received by 
the BlU. The Bcache tag RAMs are written with the 
address minus the Bcache index. This bit must be 
zero during normal operation. 

This bitfield is used only in BC_FHIT=1 mode to 
write any combination of tag status and parity bits 
in the Bcache. The parity bit can be used to write 
bad tag parity. These bits are UNDEFINED on 
reset. This bit field must be zero during normal 
operation. The field encoding is as follows: 



(continued on next page) 
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Table 39 (Cont.) Bcache Control Register Fields 



Field 



Extent 



Type 



Description 



Bcache Tag Status 
Bit 



Description 



BC_TAG_STAT<4> 
BC_TAG_STAT<3> 

BC_TAG_STAT<2> 
BC_TAG_STAT<1> 
BC TAG STAT<0> 



Parity for Bcache tag 

Parity for Bcaciie tag status 
bits 

Bcaclietag valid bit 

Bcaclietag sliared bit 

Bcaclietag dirty bit 



BC BAD DAT 



<14:13> WO,0 



El DIS ERR 



<15> 



W0,1 



PIPE LATCH 



<16> 



WO,0 



Wiien set, bits in tiiis field can be used to write 
bad data with correctable or uncorrectable errors 
in ECC mode. When bit <13>is set, data bit <0> 
and <64>are inverted. When bit <14>is set, data 
bit <l>and <65> are inverted. When the same 
octaword is read from the Bcache, the 21164 detects 
a correctable/uncorrectable ECC error on both the 
quadwords based on the value of bits <14:13> used 
when writing. This bit field must be zero during 
normal operation. 

When set, this bit causes the 21164 to ignore 
any ECC (parity) error on fill data received from 
the Bcache or main memory; or Bcache tag or 
control parity error. It also ignores a system 
command/address parity error. No machine check is 
taken when this bit is set. 

When set, this bit causes the 21164 to pipe the 
system control pins (addr_bus_req_h, cack_h, and 
dack_h) for one system clock. Refer to Section 11 
for timing details. 



(continued on next page) 
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Table 39 (Cont.) Bcache Control Register Fields 



Field 



Extent 



Type 



Description 



BC WAVE<1:0> 



<18:17> WO,0 



PM_MUX_ 
SEL<5:0> 



Reserved 
FLUSH SC VTM 



<24:19> WO,0 



<25> WO,0 

<26> WO,0 



Reserved 



<27> WO,0 



The bits in tliis field determine tlie number of 
cycles of wave pipelining that should be used during 
private read transactions of the Bcache. Wave 
pipelining cannot be used in 32-byte block systems. 

To enable wave pipelining, BC_CONFIG<07:04> 
should be set to the latency of the Bcache read. 
BC_CONTROL<18:17> should beset to the number 
of cycles to subtract from BC_CON F I G<07:04> to 
obtain the Bcache repetition rate. For example, if 
BC_CONFIG<07:04>=7and BC_CONTROL<18:17>=2, 
it takes seven cycles for valid data to arrive at the 
interface pins, but a new read will start every five 
cycles. 

The read repetition rate must be greater than 
3. For example, it is not permitted to set 
BC_CONFIG<07:04>=5and BC_CONTROL<18:17>=2. 

The value of BC_CONTROL<18:17> should be 
added to the normal value of BC_CON F I G<14:12> 
to increase the time between read and write 
transactions. This prevents a write transaction from 
starting before the last data of a read transaction is 
received. 

The bits in this field are used for selecting the BlU 
parameters to be driven to the two performance 
monitoring counters in the I box. Refer to Table 40 
for the field encoding. 

Reserved— M BZ. 

Flush Scache victim buffer. For systems without 
a Bcache, when this bit is clear, the 21164 flushes 
the onchip victim buffer if it has to write-back any 
entry from the victim buffer. When this bit is set, 
the 21164 writes only one entry back from the victim 
buffer as needed. This tends to cause read and write 
operations to be batched rather than interleaved. 

For systems with a Bcache, this bit must always be 
clear. At power-up, this bit is initialized to a value of 
0. 

Reserved— M BZ. 



(continued on next page) 
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Table 39 (Cont.) Bcache Control Register Fields 



Field Extent Type Description 



DIS_SYS_PAR <28> WO,0 When set, the 21164 does not check parity on the 

system command/address bus. H owever, correct 
parity will still be generated. 



Table 40 describes the PM_MUX_SEL fields. 
Table 40 PM_MUX_SEL Register Fields 



PM MUX SEL<21:19> Counter 1 



0x0 Scache accesses 

0x1 Scache read operations 

0x2 Scache write operations 

0x3 Scache victims 

0x4 Undefined 

0x5 Bcache accesses 

0x6 Bcache victims 

0x7 System command requests 



PM MUX SEL<24:22> Counter 2 



0x0 Scache misses 

0x1 Scache read misses 

0x2 Scache write misses 

0x3 Scache shared write operations 

0x4 Scache write operations 

0x5 Bcache misses 

0x6 System invalidate operations 

0x7 System read requests 
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8.3.5 Bcache Configuration (BC_CONFIG) Register (FF FFFO 01 C8) 

BC_CONFIG is a write-only register used to configure the size and speed 
of the external Bcache array. The bits in this register are initialized to the 
values indicated in Table 41 on reset, but not on timeout reset. Figure 64 and 
Table 41 describe the BC_CON FIG register format. 

Figure 64 Bcache Configuration (BC_CONFIG) Register 
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Table 41 Bcache Configuration Register Fields 



Field 



Extent 



Type 



Description 



BC SIZE<2:0> 



<02:00> W0,1 



The bits in tills field are used to Indicate 
tlie size of tlie Bcaclie. At power-on, tills 
field Is initialized to a value representing 
a IM-byte Bcaciie. Tiie field encoding Is as 
follows: 



BC 




SIZE<2:0>^ 


Size 


000 


Invalid Bcaciiesize 


001 


1 MB 


010 


2 MB 


oil 


4MB 


100 


8MB 


101 


16MB 


no 


32MB 


111 


64MB 



Reserved 



<03> 



WO,0 



Must bezero(MBZ). 



(continued on next page) 
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Table 41 (Cont.) Bcache Configuration Register Fields 



Field 



Extent 



Type 



Description 



BC RD SPD<3:0> 



<07:04> W0,4 



BC WR SPD<3:0> <11:08> W0,4 



The bits in tills field are used to Indicate 
to the Bl U the read access time of the 
Bcache, measured In CPU cycles, from the 
start of a read transaction until data is 
valid at the Input pins. The Bcache read 
speed must be within 4 to 10 CPU cycles. 
At power-up, this field Is Initialized to a 
value of 4 CPU cycles. 

The Bcache read and write speeds 
must be within three cycles of each 
other (absolute value = (BC_RD_SPD - 
BC_WR_SPD) < 4). 

For systems without a Bcache, the read 
speed must be equal to the sysclk to 
CPU clock ratio. In this configuration, 
BC_RD_SPD can be set to a value ranging 
from 3 to 15. 

The bits In this field are used to Indicate 
to the Bl U the write time of the Bcache, 
measured In CPU cycles. The Bcache write 
speed must be within 4 to 10 CPU cycles. 
At power-up, this field Is initialized to a 
value of four CPU cycles. 

For systems without a Bcache, the write 
speed must be equal to sysclk to CPU clock 
ratio. 



(continued on next page) 
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Table 41 (Cont.) Bcache Configuration Register Fields 



Field 



Extent 



Type 



Description 



BC_RD_WR 
SPC<2:0> 



<14:12> W0,7 



Reserved 

FILL_WE_ 
OFFSET<2:0> 



<15> WO,0 

<18:16> W0,1 



Reserved 



<19> 



WO,0 



The bits in tills field are used to Indicate 
to the BlU the number of CPU cycles to 
wait when switching from a private read 
to a private write Bcache transaction. For 
other data movement commands, such as 
READ DIRTY or FILL from main memory, 
It Is up to the system to direct systemwide 
data movement In a way that Is safe. A 
value of 1 must be the minimum value for 
this field. 

The BlU always Inserts three CPU 
cycles between private Bcache read and 
private Bcache write transactions. In 
addition to the number of CPU cycles 
specified by this field. The maximum value 
(BC_RD_WR_SPC+3) should not be greater 
than the Bcache READ speed when Bcache 
Is enabled. 

At power-up, this field Is initialized to a 
read/write spacing of seven CPU cycles. 

Must bezero(MBZ). 

Bcache write-enable pulse offset, from 
the sys_clk_outn_x edge, for FILL 
transactions from the system. This field 
does not affect private write transactions 
to Bcache. It Is used during Fl LLs from 
the system when writing the Bcache to 
determine the number of CPU cycles to 
wait before shifting out the contents of the 
write pulse field. 

This field Is programmed with a value In 
the range of one to seven CPU cycles. It 
must never exceed the sysclk ratio. For 
example. If the sysclk ratio Is 3, this field 
must not be larger than 3. At power-up, 
this field Is Initialized to a write offset 
value of one CPU cycle. 

Must bezero(MBZ). 



(continued on next page) 
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Table 41 (Cont.) Bcache Configuration Register Fields 



Field 



Extent 



Type 



Description 



BC WE CTL<8:0> <28:20> WO,0 



Reserved 



<63:29> WO 



Bcache write-enable control. This field is 
used to control the timing of the write- 
enable during a write or FILL transaction. 
If the bit is set, the write pulse is asserted. 
If the bit is clear, the write pulse is not 
asserted. Each bit corresponds to a CPU 
cycle. The least-significant bit corresponds 
to the CPU cycle in which the 21164 starts 
to drive the index for the write operation. 

For private Bcache write and shared- 
write transactions, this field is used to 
assert the write pulse without any write- 
enable pulse offset as indicated by the 
F1LL_WE_OFFSET<2:0> field. 

For FILLS to the Bcache, the 
FILL_WE_OFFSET<2:0> field determines 
the number of CPU cycles to wait before 
asserting the write pulse as programmed 
in this field. 

At power-up, all bits in this field are 
cleared. 

Ignored. 
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8.3.6 Bcache Tag Address (BC_TAG_ADDR) Register (FF FFFO 0108) 

BC_TAG_ADDR is a read-only register. Unless locked, the BC_TAG_ADDR 
register is loaded with the results of every Bcache tag read. When a tag or 
tag control parity error occurs, this register is locked against further updates. 
Software may read this register by using the 21164-specific I/O space address 
instruction. This register is unlocked whenever the EI_STAT register is read, 
or the user enters BC_FHIT mode. It is not unlocked by reset. 

Note 



The correct address is not loaded into BC_TAG_ADDR if a tag parity 
error is detected when servicing a system command from the Bcache. 



Unused tag bits in the TAG field of this register are always zero, based on the 
size of the Bcache as determined by theBC_SIZE field of the BC_CONTROL 
register. Figure 65 and Table 42 describe the BC_TAG_ADDR register format. 

Figure 65 Bcache Tag Address (BC_TAG_ADDR) Register 
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Table 42 Bcache Tag Address Register Fields 



Field 



Extent 



Type 



Description 



HIT 


<12> 


RO 


TAGCTL_P 


<13> 


RO 


TAGCTL_D 


<14> 


RO 


TAGCTL_S 


<15> 


RO 


TAGCTL_V 


<15> 


RO 


TAG_P 


<17> 


RO 


BC TAG<38:20> 


<38:20> 


RO 



If set, Bcache access resulted in a hit in 
the Bcache. 

Value of the parity bit for the Bcache tag 
status bits. 

Value of the Bcache TAG dirty bit. 

Value of the Bcache TAG shared bit. 

Valueof the Bcache TAG valid bit. 

Value of the tag parity bit. 

Bcache tag bits as read from the Bcache. 
Unused bits are read as zero. 
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8.3.7 External Interface Status (EI_STAT) Register (FF FFFO 0168) 

EI_STAT is a read-only register. Any PALcode read access of this register 
unlocks and clears it. A read access of EI_STAT also unlocks the EI_ADDR, 
BC_TAG, and FILL_SYN registers subject to some restrictions. The EI_STAT 
register is not unlocked or cleared by reset. 

Fill data from Bcache or main memory could have correctable (c) or 
uncorrectable (u) errors in ECC mode. In parity mode, fill data parity errors 
are treated as uncorrectable hard errors. System address/command parity 
errors are always treated as uncorrectable hard errors irrespective of the mode. 
The sequence for reading, unlocking, and clearing EI_ADDR, BC_TAG, FILL_ 
SYN, and EI_STAT is as follows: 

1. Read EI_ADDR, BC_TAG, and FILL_SYN in any order. Does not unlock or 
clear any register. 

2. Read EI_STAT register. Reading this register unlocks EI_ADDR, BC_TAG, 
and FILL_SYN registers. EI_STAT is also unlocked and cleared when read, 
subject to conditions described in Table 43. 

Loading and locking rules for external interface registers are defined in 
Table 43. 

Note 



If the first error is correctable, the registers are loaded but not locked. 
On the second correctable error, registers are neither loaded nor locked. 

Registers are locked on the first uncorrectable error except the 
second hard error bit. The second hard error bit is set only for an 
uncorrectable error followed by an uncorrectable error. If a correctable 
error follows an uncorrectable error, it is not logged as a second error. 
Bcache tag parity errors are uncorrectable in this context. 
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Table 43 Loading and Locking Rules for External Interface Registers 

Correctable Uncorrectable Second Hard Load Lock 

Error Error Error Register Register 



Action when El STAT is read 









Not 


possible 


No 


No 


1 





Not 


possible 


Yes 


No 





1 







Yes 


Yes 


ll 


1 







Yes 


Yes 





1 


1 




No 


Already 
locl<ed 


ll 


1 


1 




No 


Already 
locl<ed 



Clears and unlocl<s everything. 

Clears and unlocl<s everything. 

Clears and unlocl<s everything. 

Clear (c) bit does not unlock. 
Transition to (0,1,0) state. 

Clears and unlocl<s everything. 

Clear (c) bit does not unlock. 
Transition to (0,1,1) state. 



^These are special cases. It is possible that when EI_ADDR is read, only the correctable error bit is set and 
the registers are not locked. By the time EI_STAT is read, an uncorrectable error is detected and the registers 
are loaded again and locked. The value of EI_ADDR read earlier is no longer valid. Therefore, for the (l,l,x) 
case, when EI_STAT is read correctable, the error bit is cleared and the registers are not unlocked or cleared. 
Software must reexecute the I PR read sequence. On the second read operation, error bits are in (0,l,x) state, 
all the related IPRs are unlocked, and EI_STAT is cleared. 

The EI_STAT register is a read-only register used to control external interface 
registers. Figure 66 and Table 44 describe the EI_STAT register format. 

Figure 66 External Interface Status (EI_STAT) Register 
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Table 44 EI_STAT Register Fields 



Field 



Extent Type Description 



CHIPJD<3:0> 
BC_TPERR 
BC_TC_PERR 
El ES 



<27:24> RO 
<28> RO 
<29> RO 
<30> RO 



COR ECC ERR <31> RO 



UNC ECC ERR <32> RO 



El PAR ERR 



FIL IRD 



<33> RO 



<34> RO 



SEO HRD ERR <35> RO 



Read as "4." Future update revisions to tlie cliip will 
return new unique values. 

I ndicates that a Bcache read transaction encountered 
bad parity in the tag address RAM . 

I ndicates that a Bcache read transaction encountered 
bad parity in the tag control RAM . 

When set, this bit indicates that the error source is fill 
data from main memory or a system address/command 
parity error. 

When clear, the error source is fill data from the Bcache. 
This bit is only meaningful when COR_ECC_ERR, 
UNC_ECC_ERR, or EI_PAR_ERR is set. 

This bit is not defined for a Bcache tag error 
(BC_TPERR) or a Bcache tag control parity error 
(BC_TC_ERR). 

Correctable ECC error. This bit indicates that a fill data 
received from outside the CPU contained a correctable 
ECC error. 

Uncorrectable ECC error. This bit indicates that 
fill data received from outside the CPU contained 
an uncorrectable ECC error. In the parity mode, it 
indicates data parity error. 

External interface command/address parity error. This 
bit indicates that an address and command received by 
the CPU has a parity error. 

This bit has meaning only when one of the ECC or 
parity error bit is set. It is set to indicate that the error 
occurred during an l-ref FILL and clear to indicate that 
the error occurred during a D-ref FILL. 

This bit is not defined for a Bcache tag error 
(BC_TPERR) or a Bcache tag control parity error 
(BC_TC_ERR). 

Second external interface hard error. This bit indicates 
that a Fl LL from Bcache or main memory, or a system 
address/command received by the CPU has a hard error 
whileoneof the hard error bits in theEI_STAT register 
is already set. 
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8.3.8 External Interface Address (EI_ADDR) Register (FF FFFO 0148) 

EI_ADDR is a read-only register that contains the physical address associated 
with errors reported by the EI_STAT register. Its content is meaningful only 
when one of the error bits is set. A read of EI_STAT unlocks the EI_ADDR 
register. Figure 67 shows the EI_ADDR register format. 

Figure 67 External Interface Address (EIADDR) Register 
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8.3.9 Fill Syndrome (FILL_SYN) Register (FF FFFO 0068) 

FILL_SYN is a 16-bit read-only register. It is loaded but not locked on a 
correctable ECC error, so that another correctable error does not reload it. It is 
loaded and locked if an uncorrectable ECC error or parity error is recognized 
during a FILL from Bcache or main memory, as shown in Table 43. The FILL_ 
SYN register is unlocked when the EI_STAT register is read. This register is 
not unlocked by reset. 

If the 21164 is in ECC mode and an ECC error is recognized during a cache fill 
transaction, the syndrome bits associated with the bad quadword are loaded in 
theFILL_SYN register. F I LL_SYN<07:00> contains the syndrome associated 
with the lower quadword of the octaword. FILL_SYN<15:08> contains the 
syndrome associated with the higher quadword of the octaword. A syndrome 
value of means that no errors where found in the associated quadword. 

If the 21164 is in parity mode and a parity error is recognized during a cache 
fill transaction, the FILL_SYN register indicates which of the bytes in the 
octaword has bad parity. FILL_SYNDROME<07:00> is set appropriately to 
indicate the bytes within the lower quadword that were corrupted. Likewise, 
FILL_SYN<15:08> is set to indicate the corrupted bytes within the upper 
quadword. Figure 68 shows the F I LL_SYN register format. 
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Figure 68 Fill Syndrome (FILL_SYN) Register 
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Table 45 lists the syndromes associated with correctable single-bit errors. 



Table 45 Syndromes for Single-Bit Errors 



Data Bit 



Syndrome 



16 



Check Bit 


Syndrome 


00 


01 


01 


02 


02 


04 


03 


08 


04 


10 


05 


20 


06 


40 


07 


80 



00 


CE 


01 


CB 


02 


D3 


03 


D5 


04 


D6 


05 


D9 


06 


DA 


07 


DC 


08 


23 


09 


25 


10 


26 


11 


29 


12 


2A 


13 


2C 


14 


31 


15 


34 


16 


OE 


17 


OB 



(continued on next page) 
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Table 45 (Cont.) Syndromes for Single-Bit Errors 



Data Bit Syndromeie Checl< Bit Syndromei 



18 


13 


19 


15 


20 


16 


21 


19 


22 


lA 


23 


IC 


24 


E3 


25 


E5 


26 


E6 


27 


E9 


28 


EA 


29 


EC 


30 


Fl 


31 


F4 


32 


4F 


33 


4A 


34 


52 


35 


54 


36 


57 


37 


58 


38 


5B 


39 


5D 


40 


A2 


41 


A4 


42 


A7 


43 


A8 


44 


AB 


45 


AD 


46 


BO 



(continued on next page) 
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Table 45 (Cont.) Syndromes for Single-Bit Errors 



Data Bit Syndromeie Check Bit Syndromei 



47 


B5 


48 


8F 


49 


8A 


50 


92 


51 


94 


52 


97 


53 


98 


54 


9B 


55 


9D 


56 


62 


57 


64 


58 


67 


59 


68 


60 


6B 


61 


6D 


62 


70 


63 


75 
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8.4 PALcode Storage Registers 

The 21164 Ebox register file has eight extra registers that are called the 
PALshadow registers. The PALshadow registers overlay R8 through R14 and 
R25 when the CPU is in PALmodeand ICSR<SDE>is set. Thus, PALcode can 
consider R8 through R14 and R25 as local scratch. PALshadow registers can 
not be written in the last two cycles of a PALcode flow. The normal state of the 
CPU is ICSR<SDE> = ON. PALcode disables SDE for the unaligned trap and 
for error flows. 

The I box holds a bank of 24 PALtemp registers. The PALtemp registers are 
accessed with the HW_MTPR and HW_MFPR instructions. The latency from a 
PALtemp read operation to availability is one cycle. 
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8.5 Restrictions 



8.5.1 



The following sections list all known register access restrictions. A software 
tool called the PAL code violation checker (PVC) is available. This tool can be 
used to verify adherence to many of the PALcode restrictions. 

Cbox IPR PALcode Restrictions 

Table 46 describes the Cbox IPR PALcode restrictions. 



Table 46 Cbox IPR PALcode Restrictions 



Condition 



Restriction 



Store to SC_CTL, BC_CONTROL, BC_ 
CONFIG except if no bit is changed other 
than BC_CONTROL<ALLOC_CYC>, 
BC_CONTROL<PM_MUX_SEL>, or BC_ 
CONTROL <DBG_MUX_SEL>. 

Store to BC_CONTROL that only 
changes bits BC_CONTROL<^LLOC_ 
CYC>, BC_CONTROL<PM_MUX_SEL>, 
or BC_CONTROL<OBG_MUX_SEL>. 

Load from SC_STAT. 

Load from EI_STAT. 

Any Cbox IPR address. 
Any undefined Cbox IPR address. 
Scache or Bcache in force hit mode. 
Clearing of SC_F HIT in SC_CTL. 

Clearing of BC_F HIT in BC_CONTROL. 

Load from any Cbox I PR. 



M ust be preceded by M B, must be followed 
by M B, must have no concurrent cacheable 
I stream references or concurrent system 
commands. 

M ust be preceded by M B and must be 
followed by MB. 



Unlocks SC_ADDR and SC_STAT. 

Unlocks EI_ADDR, EI_STAT, FILL_SYN, 
and BC_TAG_ADDR. 

NoLDx_L or STx_C. 

No store instructions. 

No STx_C to cacheable space. 

Must be followed by MB, read operation 
of SC_STAT, then MB prior to subsequent 
store. 

Must be followed by MB, read operation 
of EI_STAT, then M B prior to subsequent 
store. 

BC_CONTROL<01> (ALLOC_CYCLE) 
must be clear. 
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8.5.2 PALcode Restrictions — Instruction Definitions 

Mbox instructions are: LDx, LDQ_U, LDx_L, HW_LD, STx, STQ_U, STx_C, 
HW_5T, and FETCHx. 

Virtual IMbox instructions are: LDx, LDQ_U, LDx_L, HW_LD (virtual), STx, 
STQ_U, STx_C, HW_ST (virtual), and FETCHx. 

Load instructions are: LDx, LDQ_U, LDx_L, and HW_LD. 

Store instructions are: STx, STQ_U, STx_C, and HW_ST. 

Table 47 lists PALcode restrictions. 



Table 47 PALcode Restrictions Table 



The following in cycle 0: 



Restrictions (Note: Numbers refer to cycle number): 



Y if checked 
by PVC^ 



CALL_PAL entry 

PALshadow write instruction 
HW_LD, locl< bit set 

HW_LD, VPTE bit set 
Any Iced instruction 



Any store instruction 



Any virtual IM box instruction 

Any IMbox instruction or WMB, 
if it traps 



Any I box trap except PC- 
mispredict, ITBIVJISS, or 
OPCDEC due to user mode 

HW RE! STALL 



NoHW_REI or HW_REI_STALL in cycle 0. Y 

NoHW_MFPR EXC_ADDR in cycleO,!. Y 

No HW_REI or HW_REI_STALL in 0, 1. Y 

PAL must slot to EO. 

No other Mbox instruction in 0. 

No other virtual reference in 0. 

NoMboxHW_MTPR or HW_MFPR in 0. Y 

No HW_MFPR MAF_MODE in 1,2 (DREAD_PENDING Y 

may not be updated). 

NoHW_MFPR DC_PERR_STAT in 1,2. Y 

No HW_MFPR DC_TEST_TAG slotted in 0. 

NoHW_MFPR DC_PERR_STAT in 1,2. Y 

No HW_MFPR MAF_MODE in 1,2 (WB_PENDING may Y 
not be updated). 

NoHW_MTPR DTBJS in 1. Y 

HW_MTPR any I box I PR not aborted in 0,1 (except that Y 
EXC_ADDR is updated with correct faulting PC). 
HW_MTPR DTBJS not aborted in 0,1. Y 

HW MTPR DTB IS not aborted in 0,1. 



Only one HW_REI_STALL in an aligned block of four 
instructions. 



■"^PALcode violation checker 



(continua:! on next page) 
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Table 47 (Cont.) PALcode Restrictions Table 



The following in cycle 0: 



Restrictions (Note: Numbers refer to cycle number): 



Y if checked 
by PVC^ 



HW_MTPR any undefined I PR 
number 

ARITH trap entry 

M achi ne check trap entry 

HW_MTPR any I box I PR 
(including PALtemp registers) 

HW_|V|TPR ASTRR, ASTER 

HW_|V|TPR SIRR 
HW_|V|TPR EXC_ADDR 
HW_|V|TPR IC_FLUSH_CTL 
HW_|V|TPR ICSR: HWE 
HW_|V|TPR ICSR: FPE 

HW_|V|TPR ICSR: SPE, FMS 

HW_MTPR ICSR: SPE 
HW_MTPR ICSR: SDE 

HW_|V|TPR ITB_ASN 

HW_MTPR ITB_PTE 

HW_MTPR ITBJAP, ITBJS, 
ITBJA 

HW_MTPR ITBJS 

HW_MTPR IVPTBR 

HW_MTPR PAL_BASE 

HW MTPR ICM 



Illegal in any cycle. 

NoHW_MFPR EXC_SUM or EXC_MASK in cycleO,!. Y 

No register file read or write access in 0,1,2,3,4,5,6,7. 
NoHW_MFPR EXC_SUM or EXC_MASK in cycleO,!. Y 

No HW_MFPR same IPR in cycle 1,2. Y 

No floating-point conditional branch in 0. 
No FEN or OPCDEC instruction in 0. 

No HW_MFPR INTID in 0,1,2,3,4,5. Y 

NoHW_REI in 0,1. Y 

No HW_MFPR INTID in 0,1,2,3,4. Y 

NoHW_REI in cycle 0,1. Y 

Must be followed by 44 inline PALcode instructions. 

NoHW_REI in 0,1,2,3. Y 

No floating-point instructions in 0, 1, 2, 3. 
NoHW_REI in 0,1,2. 

If HW_REI_STALL, then no HW_REI_STALL in 0,1. Y 

If HW_REI, then noHW_REI in 0,1,2,3,4. Y 

Must flush I cache. 

No PALshadow read/write access in 0,1,2,3. 

NoHW_REI in 0,1,2. Y 

Must be followed by HW_REI_STALL. 

NoHW_REI_STALL in cycle 0,1,2,3,4. Y 

No HW_MTPR ITBJS in 0,1,2,3. Y 

Must be followed by HW_REI_STALL. 

Must be followed by HW_REI_STALL. 

HW_REI_STALL must be in the same I stream octaword. 

NoHW_MFPR IFAULT_VA_FORM in 0,1,2. Y 

No CALL_PAL in 0,1,2,3,4,5,6,7. Y 

No HW_REI in 0,1,2,3,4,5,6. Y 

NoHW_REI in 0,1,2. Y 

No private CALL_PAL in 0,1,2,3. 



■"^PALcode violation checker 



(continued on next page) 



156 Preliminary — Subject to Ctiange — July 1996 



Table 47 (Cont.) PALcode Restrictions Table 



The following in cycle 0: 



Restrictions (Note: Numbers refer to cycle number): 



Y if checked 
by PVC^ 



HW_MTPR CC, CC_CTL 
HW_MTPR DC_FLUSH 

HW_MTPR DC_MODE 

HW_MTPR DC_PERR_STAT 
HW_MTPR DC_TEST_CTL 
HW_MTPR DC_TEST_TAG 
HW MTPR DTB ASN 



HW_MTPR DTB_CM, ALT_ 
MODE 

HW MTPR DTB PTE 



HW_MTPR DTB_TAG 

HW_MTPR DTBJAP, DTBJA 

HW_MTPR DTBJA 
HW MTPR MAP MODE 



NoRPCC in 0,1,2. Y 

NoHW_REI in 0,1. Y 

No M box instructions in 1,2. Y 

No outstanding fills in 0. 

NoHW_REI in 0,1. Y 

No M box instructions in 1,2,3,4. Y 

NoHW_MFPR DC_MODE in 1,2. Y 
No outstanding fills in 0. 

NoHW_REI in 0,1,2,3. Y 

No HW_REI_STALL in 0,1. Y 

No load or store instructions in 1. Y 

NoHW_MFPR DC_PERR_STAT in 1,2. Y 

NoHW_MFPR DC_TEST_TAG in 1,2,3. Y 
No HW_MFPR DC_TEST_CTL issued or slotted in 1,2. 

No outstanding DC fills in 0. 

NoHW_MFPR DC_TEST_TAG in 1,2,3. Y 

No virtual Mbox instructions in 1,2,3. Y 

NoHW_REI in 0,1,2. Y 

No virtual Mbox instructions in 1,2. Y 

NoHW_REI in 0,1. Y 

No virtual Mbox instructions in 2. Y 
No HW_MTPR DTB_ASN, DTB_CM, ALT_MODE, MCSR, Y 
MAF_MODE, DC_MODE, DC_PERR_STAT, 
DC_TEST_CTL, DC_TEST_TAG in 2. 

No virtual Mbox instructions in 1,2,3. Y 

No HW_MTPR DTB_TAG in 1. Y 

NoHW_MFPR DTB_PTE in 1,2. Y 

NoHW_MTPR DTBJS in 1,2. Y 

NoHW_REI in 0,1,2. Y 

No virtual Mbox instructions in 1,2,3. Y 

No HW_MTPR DTBJS in 0,1,2. Y 

NoHW_REI in 0,1,2. Y 

NoHW_MFPR DTB_PTE in 1. Y 

No Mbox instructions in 1,2,3. Y 

NoWMB in 1,2,3. Y 

NoHW_MFPR MAF_MODE in 1,2. Y 

NoHW RE! in 0,1,2. Y 



■"^PALcode violation checker 
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Table 47 (Cont.) PALcode Restrictions Table 



The following in cycle 0: 



Restrictions (Note: Numbers refer to cycle number): 



Y if checked 
by PVC^ 



HW MTPR MCSR 



HW_MTPR MVPTBR 
HW_MFPR ITB_PTE 
HW MFPR DC TEST TAG 



HW MFPR DTB PTE 



HW MFPR VA 



Novirtual Mbox instructions in 0,1,2,3,4. Y 

NoHW_MFPR MCSR in 1,2. Y 

NoHW_MFPR VA_FORM in 1,2,3. Y 

NoHW_REI in 0,1,2,3. Y 

NoHW_REI_STALL in 0,1. Y 

NoHW_MFPR VA_FORM in 1,2. Y 

NoHW_MFPR ITB_PTE_TEMP in 1,2,3. Y 

No outstanding DC fills in 0. 

No HW_MFPR DC_TEST_TAG_TEMP issued or slotted 

in 1. 

No LDx instructions slotted in 0. 

NoHW_MTPR DC_TEST_CTL between HW_MFPR 

DC_TEST_TAG and HW_MFPR DC_TEST_TAG_TEMP. 

No Mbox instructions in 0,1. Y 

No HW_MTPR DC_TEST_CTL, DC_TEST_TAG in 0,1. Y 

No HW_MFPR DTB_PTE_TEMP issued or slotted in 

1,2,3. 

NoHW_MFPR DTB_PTE in 1. Y 

Novirtual Mbox instructions in 0,1,2. Y 

Must bedonein ARITH, MACHINE CHECK, 
DTBMISS_SINGLE, UNALIGN, DFAULT traps and 
ITBMISS flow after the VPTE load. 



■■^PALcode violation checker 
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9 PALcode 

Privileged architecture library code (PALcode) is macrocode that provides an 
architecturally defined operating-system-specific programming interface that 
is common across all Alpha microprocessors. The actual implementation of 
PALcode differs for each operating system. 

PALcode runs with privileges enabled, instruction stream (I stream) mapping 
disabled, and interrupts disabled. PALcode has privilege to use five special 
opcodes that allow functions such as physical data stream (Dstream) references 
and internal processor register (I PR) manipulation. 

PALcode can be invoked by the following events: 

Reset 

System hardware exceptions (MCHK, ARITH) 

Memory-management exceptions 

Interrupts 

CALL_PAL instructions 

9.1 PALcode Entry Points 

PALcode is invoked at specific entry points. The 21164 has two types of 
PALcode entry points: 

• CALL_PAL entry points are used whenever the I box encounters a 
CALL_PAL instruction in the I stream. 

- Privileged CALL_PAL instructions start at offset 2000. 

Unprivileged CALL_PAL instructions start at offset 3000. 

• Chip-specific trap entry points start PALcode. 
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9.1.1 PALcode Trap Entry Points 

Table 48 shows the PALcode trap entry points and their offset from the 
PAL_BASE I PR. Entry points are listed from highest to lowest priority. 

Table 48 PALcode Trap Entry Points 



Entry Name 



Offseti 



RESET 
lACCVIO 



0000 
0080 



INTERRUPT 0100 

ITBMISS 0180 

DTBIV1ISS_SINGLE 0200 

DTBMISS DOUBLE 0280 



UNALIGN 


0300 


D FAULT 


0380 


MCHK 


0400 


OPCDEC 


0480 


ARITH 


0500 


FEN 


0580 



Description 



Reset 

I stream access violation or sign check error 
on PC 

I nterrupt: hardware, software, and AST 

I stream TB MISS 

DstreamTBMISS 

Dstream TBMISS during virtual page table 
entry (PTE) fetch 

Dstream unaligned reference 

Dstream fault or sign check error on virtual 
address 

U ncorrected hardware error 

Illegal opcode 

Arithmetic exception 

Floating-point operation attempted with: 

• Floating-point instructions (LD, ST, and 
operates) disabled through FPE bit in the 
ICSR IPR 

• Floating-point IEEE operation with data 
type other than S, T, or Q 
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9.2 Required PALcode Function Codes 

Table 49 lists opcodes required for all Alpha implementations. The notation 
used is oo.ffff, where oo is the hexadecimal 6-bit opcode and ffff is the 
hexadecimal 26-bit function code. 

Table 49 Required PALcode Function Codes 
Mnemonic Type Function Code 

DRAINA Privileged 00.0002 

HALT Privileged 00.0000 

1MB Unprivileged 00.0086 

9.3 Opcodes Reserved for PALcode 

Table 50 lists the opcodes reserved by the Alpha architecture for 
implementation-specific use. These opcodes are privileged and are only 
available in PALmode. Section 10.2 shows the opcodes reserved for PALcode. 



Table 50 Opcodes Reserved for PALcode 



Opcode Archiitecture Mnemonic 



IB 


PAL IB 


IF 


PAL IF 


IE 


PAL IE 


19 


PAL 19 


ID 


PAL ID 
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10 Alpha Instruction Summary 

This section contains a summary of all Alpha architecture instructions. All 
values are in hexadecimal radix. Table 51 describes the contents of the Format 
and Opcode columns that are in Table 52. 

Table 51 Instruction Format and Opcode Notation 



Instruction 
Format 


Format 
Symbol 


Opcode 
Notation 


Meaning 


Branch 


Bra 


00 


00 is the 6-bit opcode field. 


Floating- 
point 


F-P 


00. m 


00 is the 6-bit opcode field. 

fff\s the 11-bit function code field. 


M emory 


Mem 


00 


00 is the 6-bit opcode field. 


M emory/ 
function code 


Mfc 


oo.ffff 


00 is the 6-bit opcode field. 

ffff\s the 16-bit function code in the 

displacement field. 


M emory/ 
branch 


Mbr 


oo.h 


00 is the 6-bit opcode field. 

h is the high-order 2 bits of the 

displacement field. 


Operate 


Opr 


00. ff 


00 is the 6-bit opcode field. 

ff is the 7-bit function code field. 


PAL code 


Pcd 


00 


00 is the 6-bit opcode field; the 
particular PALcode instruction is 
specified in the 26-bit function code 
field. 
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Qualifiers for operate instructions are shown in Table 52. Qualifiers for 
IEEE and VAX floating-point instructions are shown in Tables 55 and 56, 
respectively. 

Table 52 Architecture Instructions 



Mnemonic 



Format Opcode 



Description 



ADDF 


F-P 


15.080 


ADDG 


F-P 


15.0A0 


ADDL 


Opr 


10.00 


ADDL/V 


Opr 


10.40 


ADDQ 


Opr 


10.20 


ADDQ/V 


Opr 


10.60 


ADDS 


F-P 


16.080 


ADDT 


F-P 


16.0A0 


AND 


Opr 


11.00 


BEQ 


Bra 


39 


BGE 


Bra 


3E 


BGT 


Bra 


3F 


BIC 


Opr 


11.0 


BIS 


Opr 


11.20 


BLBC 


Bra 


38 


BLBS 


Bra 


3C 


BLE 


Bra 


3B 


BLT 


Bra 


3A 


BNE 


Bra 


3D 


BR 


Bra 


30 


BSR 


Mbr 


34 


CALL PAL 


Pcd 


00 


CMOVEQ 


Opr 


11.24 


CMOVGE 


Opr 


11.46 


CMOVGT 


Opr 


11.66 


CMOVLBC 


Opr 


11.16 


CMOVLBS 


Opr 


11.14 


CMOVLE 


Opr 


11.64 


CMOVLT 


Opr 


11.44 


CMOVNE 


Opr 


11.26 


CMPBGE 


Opr 


lO.OF 


CMPEQ 


Opr 


10.2D 


CMPGEQ 


F-P 


15.0A5 


CMPGLE 


F-P 


15.0A7 



Add F_floating 

Add G_floating 

Add longword 

Add longword 

Add quadword 

Add quadword 

Add S_floating 

Add T_floating 

Logical product 

Branch if = zero 

Branch if > zero 

Branch if > zero 

Bit clear 

Logical sum 

Branch if low bit clear 

Branch if low bit set 

Branch if < zero 

Branch if < zero 

Branch if 7^ zero 

Unconditional branch 

Branch to subroutine 

Trap to PAL code 

CMOVE if = zero 

CMOVE if > zero 

CMOVE if > zero 

CMOVE if low bit clear 

CMOVE if low bit set 

CMOVE if < zero 

CMOVE if < zero 

CMOVE if 7^ zero 

Compare byte 

Compare signed quadword equal 

Compare G_floating equal 

Compare G_floating less than or 

equal 

(continued on next page) 
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Table 52 (Cont.) Architecture Instructions 



Mnemonic 



Format 



Opcode 



Description 



CMPGLT 


F-P 


15.0A6 


CMPLE 


Opr 


10.6D 


CMPLT 


Opr 


10.4D 


CMPTEQ 


F-P 


16.0A5 


CMPTLE 


F-P 


16.0A7 


CMPTLT 


F-P 


16.0A6 


CMPTUN 


F-P 


16.0A4 


CMPULE 


Opr 


10.3D 


CMPULT 


Opr 


lO.lD 


CPY5 


F-P 


17.020 


CPYSE 


F-P 


17.022 


CPYSN 


F-P 


17.021 


CVTDG 


F-P 


15.09E 


CVTGD 


F-P 


15.0AD 


CVTGF 


F-P 


15.0AC 


CVTGQ 


F-P 


15.0AF 


CVTLQ 


F-P 


17.010 


CVTQF 


F-P 


15.0BC 


CVTQG 


F-P 


15.0BE 


CVTQL 


F-P 


17.030 


CVTQL/SV 


F-P 


17.530 


CVTQL/V 


F-P 


17.130 


CVTQS 


F-P 


16.0BC 


CVTQT 


F-P 


16.0BE 


CVTST 


F-P 


16.2AC 


CVTTQ 


F-P 


16.0AF 


CVI IS 


F-P 


16.0AC 


DIVF 


F-P 


15.083 


DIVG 


F-P 


15.0A3 


DIVS 


F-P 


16.083 


DIVT 


F-P 


16.0A3 


EQV 


Opr 


11.48 


EXCB 


Mfc 


18.0400 


EXTBL 


Opr 


12.06 


EXTLH 


Opr 


12.6A 



Compare G_floating less than 

Compare signed quadword less 

than or equal 

Compare signed quadword less 

than 

Compare T_floating equal 

Compare T_floating less than or 

equal 

Compare T_floating less than 

Compare T_floating unordered 

Compare unsigned quadword 

less than or equal 

Compare unsigned quadword 

less than 

Copy sign 

Copy sign and exponent 

Copy sign negate 

Convert D_floating to G_floating 

Convert G_floating to D_floating 

Convert G_floating to F_floating 

Convert G_floating to quadword 

Convert longword to quadword 

Convert quadword to F_floating 

Convert quadword to G_floating 

Convert quadword to longword 

Convert quadword to longword 

Convert quadword to longword 

Convert quadword to S_floating 

Convert quadword toT_floating 

Convert S_floating toT_floating 

Convert T_floating to quadword 

Convert T_floating to S_floating 

Divide F_floating 

Divide G_floating 

Divide S_floating 

Divide T_f]oating 

Logical equivalence 

Exception barrier 

Extract byte low 

Extract longword high 

(continued on next page) 
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Table 52 (Cont.) Architecture Instructions 



Mnemonic 



Format 



Opcode 



Description 



EXTLL 


Opr 


12.26 


EXTQH 


Opr 


12.7A 


EXTQL 


Opr 


12.36 


EXTWH 


Opr 


12.5A 


EXTWL 


Opr 


12.16 


FBEQ 


Bra 


31 


FBGE 


Bra 


36 


FBGT 


Bra 


37 


FBLE 


Bra 


33 


FBLT 


Bra 


32 


FBNE 


Bra 


35 


FCMOVEQ 


F-P 


17.02A 


FCMOVGE 


F-P 


17.02D 


FCMOVGT 


F-P 


17.02F 


FCMOVLE 


F-P 


17.02E 


FCMOVLT 


F-P 


17.02C 


FCMOVNE 


F-P 


17.02B 


FETCH 


Mfc 


18.80 


FETCH M 


Mfc 


18.A0 


INSBL 


Opr 


12.0B 


INSLH 


Opr 


12.67 


INSLL 


Opr 


12.2B 


INSQH 


Opr 


12.77 


INSQL 


Opr 


12.3B 


INSWH 


Opr 


12.57 


INSWL 


Opr 


12.1B 


J MP 


Mbr 


lA.O 


JSR 


Mbr 


lA.l 


JSR COROUTINE 


Mbr 


1A.3 


LDA 


Mem 


08 


LDAH 


Mem 


09 


LDF 


Mem 


20 


LDG 


Mem 


21 


LDL 


Mem 


28 


LDL_L 


Mem 


2A 


LDQ 


Mem 


29 


LDQ L 


Mem 


2B 


LDQ U 


Mem 


OB 



Extract longword low 

Extract quadword high 

Extract quadword low 

Extract word high 

Extract word low 

Floating branch if =zero 

Floating branch if > zero 

Floating branch if >zero 

Floating branch if < zero 

Floating branch if <zero 

Floating branch \f ^ zero 

FCMOVE if = zero 

FCMOVE if > zero 

FCMOVE if > zero 

FCMOVE if < zero 

FCMOVE if < zero 

FCMOVE if 7^ zero 

Prefetch data 

Prefetch data, modify intent 

Insert byte low 

Insert longword high 

Insert longword low 

Insert quadword high 

Insert quadword low 

Insert word high 

Insert word low 

J ump 

J ump to subroutine 

J ump to subroutine return 

Load address 

Load address high 

Load F_floating 

Load G_f]oating 

Load sign-extended longword 

Load sign-extended longword 

locked 

Load quadword 

Load quadword locked 

Load unaligned quadword 

(continued on next page) 
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Table 52 (Cont.) Architecture Instructions 



Mnemonic 


Format 


Opcode 


Description 


LDS 


Mem 


22 


Load S_floating 


LDT 


Mem 


23 


Load T_floating 


MB 


Mfc 


18.4000 


Memory barrier 


MF_FPCR 


F-P 


17.025 


M ove from floating-point control 
register 


MSKBL 


Opr 


12.02 


Masl< byte low 


MSKLH 


Opr 


12.62 


Mask longword high 


MSKLL 


Opr 


12.22 


Mask longword low 


MSKQH 


Opr 


12.72 


Mask quadword high 


MSKQL 


Opr 


12.32 


Mask quadword low 


MSKWH 


Opr 


12.52 


Mask word high 


MSKWL 


Opr 


12.12 


Mask word low 


|V1T_FPCR 


F-P 


17.024 


M ove to floating-point control 
register 


MULF 


F-P 


15.082 


Multiply F_floating 


MULG 


F-P 


15.0A2 


Multiply G_floating 


MULL 


Opr 


13.00 


Multiply longword 


MULL/V 


Opr 


13.40 


Multiply longword 


MULQ 


Opr 


13.20 


Multiply quadword 


MULQ/V 


Opr 


13.60 


Multiply quadword 


MULS 


F-P 


16.082 


Multiply S_floating 


MULT 


F-P 


16.0A2 


Multiply T_floating 


ORNOT 


Opr 


11.28 


Logical sum with complement 


RC 


Mfc 


18.E0 


Read and clear 


RET 


Mbr 


1A.2 


Return from subroutine 


RPCC 


Mfc 


18.C0 


Read process cycle counter 


RS 


Mfc 


18.F000 


Read and set 


S4ADDL 


Opr 


10.02 


Scaled add longword by 4 


S4ADDQ 


Opr 


10.22 


Scaled add quadword by 4 


S4SUBL 


Opr 


lO.OB 


Scaled subtract longword by 4 


S4SUBQ 


Opr 


10.2B 


Scaled subtract quadword by 4 


S8ADDL 


Opr 


10.12 


Scaled add longword by 8 


S8ADDQ 


Opr 


10.32 


Scaled add quadword by 8 


S8SUBL 


Opr 


lO.lB 


Scaled subtract longword by 8 


S8SUBQ 


Opr 


10.3B 


Scaled subtract quadword by 8 


SLL 


Opr 


12.39 


Shift left logical 


SRA 


Opr 


12.3C 


Shift right arithmetic 


SRL 


Opr 


12.34 


Shift right logical 


STF 


Mem 


24 


Store F_floating 

(continued on next page) 
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Table 52 (Cont.) Architecture Instructions 



Mnemonic 



Format 



Opcode 



Description 



STG 
STS 
STL 

5TL_C 

STQ 

5TQ_C 

STQ_U 

STT 

SUBF 

5UBG 

SUBL 

SUBL/V 

SUBQ 

SUBQ/V 

SUBS 

SUBT 

TRAPB 

UMULH 

WMB 
XOR 
ZAP 
ZAP NOT 



Mem 


25 


Mem 


25 


Mem 


2C 


Mem 


2E 


Mem 


2D 


Mem 


2F 


Mem 


OF 


Mem 


27 


F-P 


15.081 


F-P 


15.0A1 


Opr 


10.09 




10.49 


Opr 


10.29 




10.69 


F-P 


16.081 


F-P 


16.0A1 


Mfc 


18.00 


Opr 


13.30 


Mfc 


18.44 


Opr 


11.40 


Opr 


12.30 


Opr 


12.31 



Store G_fl oat! ng 

Store S_fl oat! ng 

Store longword 

Store longword conditional 

Store quadword 

Store quadword conditional 

Store unaligned quadword 

Store T_floating 

Subtract F_floating 

Subtract G_floating 

Subtract longword 

Subtract quadword 

Subtract S_floating 

Subtract T_floating 

Trap barrier 

Unsigned multiply quadword 

high 

Write memory barrier 

Logical difference 

Zero bytes 

Zero bytes not 



10.1 Opcodes Reserved for Digital 

Table 53 lists opcodes reserved for Digital. 

Table 53 Opcodes Reserved for Digital 



Mnemonic 


Opcode 


Mnemonic 


Opcode 


Mnemonic 


Opcode 


OPCOl 


01 


OPC05 


05 


OPCOB 


OB 


OPC02 


02 


OPC06 


06 


OPCOC 


OC 


OPC03 


03 


OPC07 


07 


OPCOD 


OD 


OPC04 


04 


OPCOA 


OA 


0PC14 


14 
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10.2 Opcodes Reserved for PALcode 

Table 54 lists the 21164-specific instructions. For nnore information, refer to 
the Alpha 21164 Microprocessor H ardware Rdterence M anual . 



Table 54 Opcodes Reserved for PALcode 



21164 
Mnemonic 



Opcode 



Architecture 
l\/lnemonic 



Function 



HW LD 



HW ST 



HW REI 



HW MFPR 



HW MTPR 



IB 



IF 



IE 



19 



ID 



PAL IB 



PAL IF 



PAL IE 



PAL 19 



PAL ID 



Performs Dstream load 
instructions. 

Performs Dstream store 
instructions. 

Returns instruction flow to the 
program counter (PC) pointed 
to by EXC_ADDR internal 
processor register (I PR). 

Accesses the I box, M box, and 
DcachelPRs. 

Accesses the I box, M box, and 
DcachelPRs. 



10.3 IEEE Floating-Point Instructions 

Table 55 lists the hexadecinnal value of the 11-bit function code field for the 
IEEE floating-point instructions, with and without qualifiers. The opcode for 
these instructions is 16i6. 

Table 55 IEEE Floating-Point Instruction Function Codes 



Mnemonic 


None 


/C 


/M 


/D 


/u 


/uc 


/UM 


/UD 


ADDS 


080 


000 


040 


oco 


180 


100 


140 


ICO 


ADDT 


OAO 


020 


060 


OEO 


lAO 


120 


160 


lEO 


CMPTEQ 


0A5 
















CMPTLT 


0A6 
















CMPTLE 


0A7 
















CMPTUN 


0A4 
















CVTQS 


OBC 


OBC 


07C 


OFC 










CVTQT 


OBE 


OBE 


07E 


OFE 










CVTTS 


OAC 


02C 


06C 


OEC 


lAC 


12C 


16C 
(continued 


lEC 
on next page) 
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Table 55 (Cont.) IEEE Floating-Point Instruction Function Codes 



Mnemonic 



None 



/C 



/M 



/D 



/U 



/UC 



/UIUI 



/UD 



DIVS 


083 


003 


043 


0C3 


183 


103 


143 


1C3 


DIVT 


0A3 


023 


063 


0E3 


1A3 


123 


163 


IE 3 


MULS 


082 


002 


042 


0C2 


182 


102 


142 


1C2 


MULT 


0A2 


022 


062 


0E2 


1A2 


122 


162 


IE 2 


SUBS 


081 


001 


041 


OCl 


181 


101 


141 


ICl 


SUBT 


OAl 


021 


061 


OEl 


lAl 


121 


161 


lEl 



Mnemonic 



/SU 



/sue /SUM /SUD /SU! 



/SUIC /SUIM /SUID 



ADDS 


580 


500 


540 


SCO 


780 


700 


740 


7C0 


ADDT 


SAO 


520 


560 


5E0 


7A0 


720 


760 


7E0 


CMPTEQ 


5A5 
















CMPTLT 


5A6 
















CMPTLE 


5A7 
















CMPTUN 


5A4 
















CVTQS 










7BC 


73C 


77C 


7FC 


CVTQT 










7BE 


73E 


77E 


7FE 


CVTTS 


SAC 


52C 


56C 


SEC 


7AC 


72C 


76C 


7EC 


DIVS 


583 


503 


543 


SC3 


783 


703 


743 


7C3 


DIVT 


5A3 


523 


563 


SE3 


7A3 


723 


763 


7E3 


MULS 


582 


502 


542 


SC2 


782 


702 


742 


7C2 


MULT 


5A2 


522 


562 


SE2 


7A2 


722 


762 


7E2 


SUBS 


581 


501 


541 


SCI 


781 


701 


741 


7C1 


SUBT 


5A1 


521 


561 


SEl 


7A1 


721 


761 


7E1 



Mnemonic 


None 


/S 














CVTST 


2AC 


6AC 














Mnemonic 


None 


/C 


/V 


/vc 


/SV 


/SVC 


/SVI 


/SVIC 


CVTTQ 


OAF 


02F 


lAF 


12F 


SAF 


S2F 


7AF 


72F 


Mnemonic 


D 


/VD 


/SVD 


/SVID 


/M 


/VM 


/SVM 


/SVIM 


CVTTQ 


OEF 


lEF 


SEF 


7EF 


06F 


16F 


S6F 


76F 
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Programming Note 

Because underflow cannot occur for CMPTxx, there is no difference 
in function or performance between CIMPTxx/S and CIMPTxx/SU. It is 
intended tinat software generate CIM PTxx/SU in place of CM PTxx/S. 

In the same manner, CVTQS and CVTQT can take an inexact result 
trap, but not an underflow. Because there is no encoding for a 
CVTQx/SI instruction, it is intended that software generate CVTQx/SU I 
in place of CVTQx/SI. 



10.4 VAX Floating-Point Instructions 

Table 56 lists the hexadecimal value of the 11-bit function code field for the 
VAX floating-point instructions. The opcode for these instructions is 15i6. 

Table 56 VAX Floating-Point Instruction Function Codes 



Mnemonic 


None 


/C 


/U 


/UC 


/S 


/SC 


/SU 


/sue 


ADDF 


080 


000 


180 


100 


480 


400 


580 


soo 


CVTDG 


09E 


OlE 


19E 


llE 


49E 


41E 


59E 


SIE 


ADDG 


OAO 


020 


lAO 


120 


4A0 


420 


SAO 


S20 


CMPGEQ 


0A5 








4A5 








CMPGLT 


0A6 








4A6 








CMPGLE 


0A7 








4A7 








CVTGF 


OAC 


02C 


lAC 


12C 


4AC 


42C 


SAC 


S2C 


CVTGD 


OAD 


02D 


IAD 


12D 


4AD 


42D 


SAD 


S2D 


CVTQF 


OBC 


OBC 














CVTQG 


OBE 


OBE 














DIVF 


083 


OOB 


183 


103 


483 


403 


S83 


SOB 


DIVG 


0A3 


023 


1A3 


123 


4A3 


423 


SAB 


S2B 


MULF 


082 


002 


182 


102 


482 


402 


S82 


S02 


MULG 


0A2 


022 


1A2 


122 


4A2 


422 


SA2 


S22 


SUBF 


081 


001 


181 


101 


481 


401 


S81 


SOI 


SUBG 


OAl 


021 


lAl 


121 


4A1 


421 


SAl 


S21 


Mnemonic 


None 


/C 


/V 


/vc 


/S 


/SC 


/sv 


/SVC 


CVTGQ 


OAF 


02F 


lAF 


12F 


4AF 


42F 


SAF 


S2F 
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10.5 Opcode Summary 

Table 57 lists all Alpha opcodes from 00 (CALL_PAL) through 3F (BGT). 
In the table, the column headings that appear over the instructions have a 
granularity of 8i6. The rows beneath the Offset column supply the individual 
hexadecimal number to resolve that granularity. 

If an instruction column has a in the right (low) hexadecimal digit, replace 
that with the number to the left of the backslash (\ ) in the Offset column 
on the instruction's row. If an instruction column has an 8 in the right (low) 
hexadecimal digit, replace that 8 with the number to the right of the backslash 
in the Offset column. 

For example, the third row (2/A) under the lOie column contains the symbol 
I NTS*, representing the all -integer shift instructions. The opcode for those 
instructions would then be 12i6 because the in 10 is replaced by the 2 in 
the Offset column. Likewise, the third row under the 18i6 column contains 
the symbol J SR*, representing all jump instructions. The opcode for those 
instructions is lA because the 8 in the heading is replaced by the number to 
the right of the backslash in the Offset column. 

The instruction format is listed under the instruction symbol. 
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Table 57 Opcode Summary 



Offset 



00 



08 



10 



18 



20 



28 



30 



38 



0/8 
1/9 
2/A 

3/B 
4/C 
5/D 

e/E 

7/F 



PAL* 
(pal) 

Res 
Res 
Res 
Res 
Res 
Res 
Res 



LDA 
(mem) 


INTA* 
(op) 


MISC* 
(mem) 


LDF 
(mem) 


LDL 

(mem) 


BR 
(br) 


BLBC 
(br) 


LDAH 
(mem) 


INTL* 
(op) 


\ PAL\ 


LDG 
(mem) 


LDQ 

(mem) 


FBEQ 
(br) 


BEQ 

(br) 


Res 


INTS* 
(op) 


JSR* 
(mem) 


LDS 
(mem) 


LDL_L 

(mem) 


FBLT 

(br) 


BLT 
(br) 


LDQ_U 

(mem) 


INTM* 
(op) 


\ PAL\ 


LDT 
(mem) 


LDO_L 

(mem) 


FBLE 
(br) 


BLE 

(br) 


Res 


Res 


Res 


STF 

(mem) 


STL 

(mem) 


BSR 

(br) 


BLBS 

(br) 


Res 


FLTV* 
(op) 


\ PAL\ 


STG 

(mem) 


STO 

(mem) 


FBNE 
(br) 


BNE 
(br) 


Res 


FLTI* 
(op) 


\ PAL\ 


STS 

(mem) 


STL_C 
(mem) 


FBGE 

(br) 


BGE 

(br) 


STO_U 
(mem) 


FLTL* 
(op) 


\ PAL\ 


STT 

(mem) 


STO_C 

(mem) 


FBGT 
(br) 


BGT 

(br) 



Symbol 


Meaning 


FLTI* 


IEEE floating-point instruction opcodes 


FLTL* 


Floating-point operate instruction opcodes 


FLTV* 


VAX floating-point instruction opcodes 


INTA* 


Integer arithmetic instruction opcodes 


INTL* 


Integer logical instruction opcodes 


INTM* 


Integer multiply instruction opcodes 


INTS* 


Integer shift instruction opcodes 


JSR* 


J ump instruction opcodes 


MISC* 


Miscellaneous instruction opcodes 


PAL* 


PALcode instruction (CALL PAL) opcodes 


\ PAL\ 


Reserved for PALcode 


Res 


Reserved for Digital 
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10.6 Required PALcode Function Codes 

Table 58 lists opcodes required for all Alpha implementations. The notation 
used is oo.ffff, where oo is the hexadecimal 6-bit opcode and ffff is the 
hexadecimal 26-bit function code. 

Table 58 Required PALcode Function Codes 
Mnemonic Type Function Code 



DRAINA 


Privileged 


00.0002 


HALT 


Privileged 


00.0000 


1MB 


Unprivileged 


00.0086 
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11 Electrical Data 

This chapter describes the electrical characteristics of the 21164 component 
and its interface pins. It is organized as follows: 

Electrical characteristics 

dc characteristics 

Clocking scheme 

ac characteristics 

Power supply considerations 

11.1 Electrical Characteristics 

Table 59 lists the maximum ratings for the 21164. 

Table 59 Alpha 21164 Absolute Maximum Ratings 
Characteristics Ratings 

Storage temperature -55°C to 125°C (-67°F to 257°F) 

J unction temperature 15°C to 90°C (59°F to 194°F) 

Supply voltage Vss -0.5 V, Vdd 3.6 V 

Input or output applied^ -0.5 V to 6.3 V 

Typical worst case power 

(^dd =3.3V 

Frequency =266 MHz 46 W 

Frequency =300 MHz 51 W 

Frequency =333 MHz 56 W 

iRefer to Section 11.5.2. 
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Caution 



Stress beyond the absolute maximum rating can cause permanent 
damageto the 21164. Exposure to absolute maximum rating conditions 
for extended periods of time can affect the 21164 reliability. 



11.2 dc Characteristics 

The 21164 is designed to run in a CMOS/TTL environment. The 21164 is 
tested and characterized in a CMOS environment. 

11.2.1 Power Supply 

The Vss pins are connected to 0.0 V, and the Vdd pins are connected to 3.3 V, 
±5%. 

11.2.2 Input Signal Pins 

Nearly all input signals are ordinary CMOS inputs with standard TTL 
levels (see Table 60). (See Section 11.3.1 for a description of an exception— 
osc_clk_in_h,l.) 

After power has been applied, input and bidirectional pins can be driven to a 
maximum dc voltage of 6.3 V (6.8 V for 1 ns) without harming the 21164. (It is 
not necessary to use static RAMs with 3.3-V outputs.) 

11.2.3 Output Signal Pins 

Output pins are ordinary 3.3-V CMOS outputs. Although output signals are 
rail-to-rail, timing is specified to ^^. 

Bidirectional pins are either input or output pins, depending on control timing. 
When functioning as output pins, they are ordinary 3.3-V CMOS outputs. 

Table 60 shows the CMOS dc input and output pins. 
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Table 60 


CMOS dc Input/Output Characteristics 








Parameter 


Requi 
IVIin. 


i rements 
IVIax. 


Units 




Symbol 


Description 


Test Conditions 


Vih 


High-level input voltage 


2.0 


— 


V 


— 


Vil 


Low-level input voltage 


— 


0.8 


V 


— 


Voh 


High-level output voltage 


2.4 


— 


V 


loh =-6.0 mA 


Vol 


Low-level output voltage 


— 


0.4 


V 


lol = 5.0 mA 


lil_pd 


Input with pull-down 
leakage current 


— 


±50 


/xA 


Vin =0V 


lih_pd 


Input with pull-down 
current 


— 


200 


/xA 


Vin =2.4V 


lil_pu 


1 nput with pull-up current 


— 


-800 


nA 


Vin =0.4V 


lihpu 


Input with pull-up leakage 
current 


— 


±50 


nA 


Vin =Vdd V 


lozlpd 


Output with pull-down 
leakage current (tristate) 


— 


±100 


nA 


Vin =0V 


1 ozhpd 


Output with pull-down 
current (tristate) 


— 


300 


nA 


Vin =2.4V 


lozlpu 


Output with pull-up current 
(tristate) 


— 


-800 


/xA 


Vin =0.4V 


1 ozhpu 


Output with pull-up 
leakage current (tristate) 


— 


±100 


nA 


Vin =Vdd V 


Idd 


Peak power supply current 


— 


18 


A 


Vdd = 3.465 V 

F requency = 266 MHz 


Idd 


Peak power supply current 


— 


20 


A 


Vdd = 3.465 V 

F requency = 300 MHz 


Idd 


Peak power supply current 


— 


22 


A 


Vdd = 3.465 V 
Frequency =333 MHz 



Most pins have low current pull -down devices to Vss. However, two pins have 
a pull-up device to Vdd. The pull-downs (or pull-ups) are always enabled. This 
means that some current will flow from the 21164 (if the pin has a pull-up 
device) or into the 21164 (if the pin has a pull-down device) even when the pin 
is in the high-impedance state. All pins have pull-down devices, except for the 
pins in the following table: 
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Signal Name Notes 



tms_h Has a pull-up device 

tdi_h Has a pull-up device 

osc_clk_in_h 50 n to Vterm (es^) (See Figure 69) 

osc_clk_in_l 50 n to Vterm {^^) (See Figure 69) 

tempsense 150 n to Vss 

11.3 Clocking Scheme 

The differential input clock signals osc_clk_in_h,l run at two times the 
internal frequency of the time base for the 21164. I nput clocks are divided by 
two onchip to generate a 50% duty cycle clock for internal distribution. The 
output signal cpu_clk_out_h toggles with an unspecified propagation delay 
relative to the transitions on osc_clk_in_h,l. 

System designers have a choice of two system clocking schemes to run the 
21164 synchronous to the system: 

1. The 21164 generates and drives out a system clock, sys_clk_outl_h,l. It 
runs synchronous to the internal clock at a selected ratio of the internal 
clock frequency. There is a small clock skew between the internal clock and 
sysci k_outl_h,l . 

2. The 21164 synchronizes to a system clock, ref_clk_in_h, supplied by 
the system. The ref_clk_in_h clock runs at a selected ratio of the 21164 
internal clock frequency. The internal clock is synchronized to the reference 
clock by an onchip digital phase-locked loop (DPLL). 

11.3.1 Input Clocks 

The differential input clocks osc_clk_in_h,l provide the time base for the 
chip when dc_ok_h is asserted. These pins are self-biasing, and must be 
capacitively coupled to the clock source on the module, or they can be directly 
driven. The terminations on these signals are designed to be compatible with 
system oscillators of arbitrary dc bias. The oscillator must have a duty cycle 
of 60°/cv'40% or tighter. Figure 69 shows the input network and the schematic 
equivalent of osc_clk_in_h,l terminations. 



Preliminary — Subject to Change — Juiy 1996 177 



Figure 69 osc_clk_in_h,l Input Network and Terminations 



Module Circuitry J "OnchiplDlrcuitry" 

osc_clkJn_h 1 6 nH 



_^3.3pF 



©50O 
Oscillator 



osc elk in I 



_^3.3pF 



Vdd 

o ° ^^\ 

130 n to 
600 n 



__3.3pF 



__3.3pF 



47 pF 



50 Q 



To 

Differential 

Amplifier 



6nH 



50 O 



Note: 

* Coupling Capacitors 47pF to 220 pF lj-04035.ai 

Ring Oscillator 

When signal dc_ok_h is deasserted, the clock outputs follow the internal ring 
oscillator. The 21164 runs off the ring oscillator, just as it would when an 
external clock is applied. The frequency of the ring oscillator varies from chip 
to chip within a range of 10 MHz to 100 MHz. This corresponds to an internal 
CPU clock frequency range of 5 MHz to 50 MHz. The system clock divisor is 
forced to 8, and the sys_clk_out2 delay is forced to 3. 

Clock Sniffer 

A special onchip circuit monitors the osc_clk_in pins and detects when input 
clocks are not present. When activated, this circuit switches the 21164 clock 
generator from the osc_clk_in pins to the internal ring oscillator. This 
happens independently of the state of the dc_ok_h pin. The dc_ok_h pin 
functions normally if clocks are present on the osc_clk_in pins. 
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11.3.2 Clock Termination and Impedance Levels 

In Figure 69, the clock is designed to approximate a 50-O termination for the 
purpose of impedance matching for those systems that drive input clocks across 
long traces. The clock input pins appear as a 50-O series termination resistor 
connected to a high impedance voltage source. The voltage source produces a 
nominal voltage value of ^^. The source has an impedance of between 130 O 
and 600 O. This voltage is called the self-bias voltage and sources current when 
the applied voltage at the clock input pins is less than the self-bias voltage. 
It sinks current when the applied voltage exceeds the self-bias voltage. This 
high impedance bias driver allows a clock source of arbitrary dc bias to be ac 
coupled to the 21164. The peak-to-peak amplitude of the clock source must 
be between 0.6 V and 3.0 V. Either a square-wave or a sinusoidal source may 
be used. Full-rail clocks may be driven by testers. I n any case, the oscillator 
should be ac coupled to the osc_clk_in_h,l inputs by 47 pF through 220 pF 
capacitors. 

11.3.3 ac Coupling 

Using series coupling (blocking) capacitors renders the 21164 clock input pins 
insensitive to the oscillator's dc level. When connected this way, oscillators 
with any dc offset relative to Vss can be used provided they can drive a signal 
into the osc_clk_in_h,l pins with a peak-to-peak level of at least 600 mV, but 
no greater than 3.0 V peak to peak. 

The value of the coupling capacitor is not overly critical. However, it should be 
sufficiently low impedance at the clock frequency so that the oscillator's output 
signal (when measured at the osc_clk_in_h,l pins) is not attenuated below 
the 600 mV peak-to-peak lower limit. For sine waves or oscillators producing 
nearly sinusoidal (pseu do square wave) outputs, 220 pF is recommended at 
533.3 MHz (266.6 MHz x 2). A high quality dielectric such as NPO is required 
to avoid dielectric losses. 

Table 61 shows the input clock specification. 
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Table 61 Input Clock Specification 



Signal Parameter Minimum lUlaximum Unit 



oscclkjnhj symmetry 
oscclkjnhj voltage 
oscclkjnh,! Z input 

Tfreq (CPU clock frequency) 
Tcycle(^) 



40 


60 




% 


0.6 


3.0 




V (peak-to-peak) 


Refer to Figu 


re 70, 


Clock 1 


nput Differential 


1 mpedance. 








100 


3331 




MHz 


3^ 


10 




ns 



^Maximum CPU dock frequency is either 333 IMHz, 300 JVJHz, or 266 JVJHz, depending upon part 
variation. 
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Figure 70 Clock Input Differential Impedance 
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11.4 ac Characteristics 

This section describes the ac timing specifications for the 21164. 

11.4.1 Test Configuration 

All input timing is specified relative to the crossing of standard TTL input 
levels of 0.8 V and 2.0 V. Output timing is to the nominal CMOS switch point 
of ^ (see Figure 71). 

Figure 71 Input/Output Pin Timing 
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Because the speed and complexity of microprocessors lias increased 
substantially over the years, it is necessary to change the way they are 
tested. Traditional assumptions that all loads can be lumped into some 
accumulation of capacitance cannot be employed any more. Rather, the model 
of a transmission line with discrete loads is a much more realistic approach for 
current test technology. 

Typically, printed circuit board (PCB) etch has a characteristic impedance 
of approximately 75 O. This may vary from 60 O to90 O with tolerances. 
If the line is driven in the electrical center, the load could be as low as 
30 O. Therefore, a characteristic impedance range of 30 O to90 O could be 
experienced. 

The 21164 output drivers are designed with typical printed circuit board 
applications in mind rather than trying to accommodate a 40-pF test load 
specification. As such, it "launches" a voltage step into a characteristic 
impedance, ranging from 30 O to 90 O. 

To prevent signal quality problems due to overshoot or ringing, "near end" 
terminated transmission line design rules are used. By combining the source 
impedance of the driver transistors with an additional 20-O onchip resistor, a 
source impedance of approximately 40 O is achieved. Additionally, a load value 
of 10 pF, when added to the PCB etch delays, provides a realistic estimate of 
actual system timing. When employing this test configuration, the signal at 
the end of the line will transition cleanly through theTTL input specification 
range of 0.8 V to 2.0 V without plateaus, or reversal into the range. 

11.4.2 Pin Timing 

The following sections describe Bcache loop timing, sys_clk-based system 
timing, and reference clock-based system timing. 

Backup Cache Loop Timing 

The 21164 can be configured to support an optional offchip backup cache 
(Bcache). Private Bcache read or write (Scache victims) transactions initiated 
by the 21164 are independent of the system clocking scheme. Bcache loop 
timing must bean integer multiple of the 21164 cycle time. 

Table 62 lists the Bcache loop timing. 
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Table 62 Bcache Loop Timing 



Signal 


Specification 


Value 


Name 


data_h<127:0> 


Input setup 


1.1ns 


Tdsu 


data_h<127:0> 


Input hold 


0.0 ns 


Tdh 


index_h<25:4> 


Output delay 


Tdd +0.4nsi 


Tiod 


index_h<25:4> 


Output hold time 


Tmdd 


Tioh 


data_h<127:0> 


Output delay 


Tdd +Tcycle +0.4 ns^ 


Tdod 


data_h<127:0> 


Output hold 


Tmdd +Tcycle 


Tdoh 


^The value 0.4 ns accoL 


intsfor onchip driver a 


nd dock skew. 





Outgoing Bcache index and data signals are driven off the internal clock edge 
and the incoming Bcache tag and data signals are latched on the same internal 
clock edge. Table 63 shows the output driver characteristics. 



Table 63 Output Driver Characteristics 



Specification 



40-pF Load 



10-pF Load 



Name 



Maximum driver delay 
M inimum driver delay 



2.6 ns 

1.0 ns 



1.5 ns 
1.0 ns 



Tdd 
Tmdd 



Output pin timing is specified for lumped 40-pF and 10-pF loads. In some 
cases, the circuit may have loads higher than 40 pF. The 21164 can safely drive 
higher loads provided the average charging or discharging current from each 
pin is 10 mA or less. The following equation can be used to determine the 
maximum capacitance that can be safely driven by each pin: 

Cmax (in pF) = 3t, where t is the waveform period (measured from rising 
to rising or falling to falling edge), in nanoseconds. 

For example, if the waveform appearing on a given I/O pin has a 20.4-ns 
period, it can safely drive up to and including 61 pF. 

Figure 72 shows the Bcache read and write timing. 
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Figure 72 Bcache Timing 
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sys_clk-Based Systems 

All timing is specified relative to tiie rising edge of the internal CPU clock. 

Table 64 shows 21164 system clock sys_clk_outl_h,l output timing. Setup 
and hold times are specified independent of the relative capacitive loading of 
sys_clk_outl_h,l, addr_h<39:4>, data_h<127:0>, and cmd_h<3:0> signals. 
The ref_clk_in_h signal must be tied to Vdd for proper operation. 
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Table 64 Alpha 21164 System Clock Output Timing (sysclksTg) 



Signal 


Specification 




Value 


Name 


sys_clk_outl_h,l 


Output delay 




Tdd 


Tsysd 


sys_clk_outl_h,l 


Minimum output delay 


Tmdd 


Tsysdm 


data bus req h, 

data_h<127:0>, 

addr_h<39:4> 


Input setup 




1.1ns 


Tdsu 


data bus req h, 
data h<127:0>, 
addr_h<39:4> 


Input hdd 




ns 


Tdh 


addr_h<39:4> 


Output delay 




Tdd +0.4nsi 


Taod 


addr_h<39:4> 


Output hold time 




Tmdd 


Taoh 


data_h<127:0> 


Output delay 




Tdd +Tcycle +0.4ns^ 


Tdod^ 


data_h<127:0> 


Output hold time 




Tmdd +Tcycle^ 


Tdoh^ 




Non-Pipe_ 


_Latch Mode 




addr_bus_req_h 


Input setup 




3.8 ns 


Tabrsu 


addr_bus_req_h 


Input hold 




-1.0 ns 


Tabrh 


dack_h 


Input setup 




3.4 ns 


Tntacksu 


cack_h 


Input setup 




3.7 ns 


Tntcacksu 


cack, dack 


Input hdd 




-1.0 ns 


Tntackh 


Pipe_Latch Mode^ 


addr_bus_req_h, 
cackh, dackh 


Input setup 




1.1ns 


Ttacksu 


addr_bus_req_h, 
cackh, dackh 


Input hold 




ns 


Ttackh 



^The value 0.4 ns accounts for onchip driver and dock skew. 

^For all write transactions initiated by the 21164, data is driven one CPU cycle after the 
sys_clk_outl or index_h'^5:4>pins. 

^1 n pipe_latch mode, control signals are piped onchip for one sys_clk_outl_h,l before usage. 



Figure 73 shows sys_clk system timing. 
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Figure 73 sys_clk System Timing 
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Reference Clock-Based Systems 

Systems that generate their own system clock expect the 21164 to synchronize 
its sys_clk_outl_h,l outputs to their system clocl<. The 21164 uses a digital 
phase-locked loop (DPLL) to synchronize its sys_clk_outl signals to the 
system clock that is applied to the ref_clk_in_h signal. 

Table 65 shows all timing relative to the rising edge of ref_clk_in_h. 



Table 65 Alpha 21164 Reference Clock Input Timing 



Signal 


Specification 


Value 


Name 


data_bus_req_h, 

data_h<127:0>, 

addr_h<39:4> 


Input setup 


1.1ns 


Tdsu 


data bus req h, 
data h<127:0>, 
addr_h<39:4> 


Input hold 


0.5 X Tcycle 


Troh 


addr_h<39:4> 


Output delay 


Tdd + 0.5 X Tcycle + 0.9 ns^ 


Traod 


addr_h<39:4> 


Output hold 
time 


Tmdd 


Traoh 


data_h<127:0> 


Output delay 


Tdd + 1.5 X Tcycle + 0.9 ns^ 


Trdod^ 


data_h<127:0> 


Output hold 
time 


Tmdd + Tcycle 


Trdoh^ 




Non- 


■Pipe_Latch Mode 




addr_bus_req_h 


Input setup 


3.8 ns 


Tntrabrsu 


addr_bus_req_h 


Input hdd 


0.5 X Tcycle 


Tntrabrh 


dack_h 


Input setup 


3.3 ns 


Tntracksu 


cackh 


Input setup 


3.7 ns 


Tntrcacksu 


cackh, dackh 


Input hold 


(0.5 X Tcycle) 


Tntrackh 



■^The value 0.9 ns accounts for onchip skews that include 0.4 ns for driver and clocl< skew, phase 
detector skews due to circuit delay (0.2 ns), and delay in ref_clic_ln_h due to the package (0.3 ns). 

^For all write transactions initiated by the 21164, data is driven one CPU cycle later. 

(continued on next page) 
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Table 65 (Cont.) 


Alpha 21164 Reference Clock Input Timing 




Signal 


Specification Value 


Name 


Pipe_Latch Mode^ 



addr_bus_req_h, Input setup 1.1ns 

cackh, dackh 

addr_bus_req_h, Input hold O.SxTcycle 

cack h, dack h 



Ttracksu 
Ttrackh 



^1 n pipejatch mode, control signals are piped onchip for one sys_clk_outl_h,l before usage. 

11.4.3 Digital Phase-Locked Loop 

Figure 74 and Table 66 describe the digital pinase-locked loop (DPLL) stages of 
operation. 

Figure 74 ref_clk System Timing 
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Table 66 ref_clk System Timing Stages 



Stage Description 



1 The internal CPU clock rising edge coincides with the rising edge of 
refclkjnh. 

2 The DPLL causes the internal CPU clock to stretch for one phase (1 cycle 
of oscclkjnh,!). 

3 The stretch causes ref_clk_in_h to lead the internal CPU clock by one 
phase. 

4 The CPU clock is always slightly faster than the external ref_clk_in_h 
and gains on ref_clk_in_h over time. Eventually the gain equals one 
phase and a new stretch phase follows. 

Although systems that supply a ref_clk_in_h do not use sys_clk_outl_h,l, 

a relationship between the two signals exists, just as in the sys_clk-based 
systems, because the 21164 uses sys_clk_outl_h,l internally to determine 
timing during system transactions. 

11.4.4 Timing — Additional Signals 

This section lists timing for all other signals. 

Asynchronous Input Signals 

The following is a list of the asynchronous input signals: 

clk_mode_h dc_ok_h ref_clk_in_h 

sys_reset_l-'^ 

perf_mon_h^ 

irq_h<3:0>2 mch_hlt_irq_h2 pwr_fail_irq_h2 sys_mch_chk_irq_h2 

■"^Signal sys_reset_l may be deasserted synchronously. 
^These signals can also be used synchronously. 

Miscellaneous Signals 

Table 67 and Table 68 list the timing for miscellaneous input-only and output- 
only signals. All timing is expressed in nanoseconds. 
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Table 67 


Input Timing for sys_clk_out- or ref_clk_in-Based Systems 




Value Name 


Signal 


Specification sys_clk_out ref_clk_in sys_clk_out ref_clk_in 



cfail_h, fill_h, fill_error_h, fill_id_h, 
fill_nocheck_h, idle_bc_h, shared_h, 
system_lock_f1ag_h 

irq_h<3:0>, mch_hlt_irq_h, pwr_ 
fail_irq_h, sys_mch_chk_irq_h 

Testability pins: 
port_mode_h, srom_data_h, 
srom_present_l 



Input setup 1.1ns 



1.1 ns 



Tdsu 



Tdsu 



cfail_h, fill_h, fill_error_h, fill_id_h, 
fill_nocheck_h, idle_bc_h, shared_h, 
system_lock_f1ag_h 

irq_h<3:0>, mch_hlt_irq_h, pwr_ 
fail_irq_h, sys_mch_chk_irq_h 



Input hold 



ns 



0.5*Tcycle Tdh 



Troh 



sys_reset_l 

Testability pins: 
port_mode_h, srom_data_h, 
srom_present_l 



Table 68 


Output Timing for 


sys. 


_clk 


_out- 


or ref_clk_ 


Jn-Based Systems 






Specification 






Clocking System Value Clocking System Name 


Signal 


sys_ 


.clk_( 


3Ut 


ref_clk_ 


in sys_clk_out 


ref_clk_in 


Unidirectional Signals 



addr_res_h, 

int4_valid_h,i 

scache_set_h, 

srom_clk_h, 

srom_oe_l, 

victim_pending_h 



Output 
delay 



Tdd+0.4 ns 



Tdd-tO.5*Tcycle+0.9 ns Taod 



Traod 



■"^Read transaction 



(continua:! on next page) 
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Table 68 (Cont.) Output Timing for sys_clk_out- or ref_clk_in-Based Systems 



addr res h, 


Output 


int4 valid h} 


hold 


scache set h, 




srom elk h, 




srom oe 1, 




victi mpendi ngh 




int4 valid h^ 


Output 




delay 


int4 valid h^ 


Output 




hold 



Tmdd 



Tmdd 



Clocking System Value 


Clocking System Name 


Signal Specification sys_clk_out ref_clk_in 


sys_clk_out ref_clk_in 


Unidirectional Signals 



Taoh 



Traoh 



Tdd+Tcycle+0.4 ns Tdd+1.5*Tcycle+0.9 ns Tdod 
Tmdd+Tcycle Tmdd+Tcycle Tdoh 



Trdod 
Trdoh 



Bidirectional Signals 



I nput mode: 

addr_cmd_par_h, I nput setup 1.1 ns 

cmd_h, 

data_check_h,-'^ 

tag_ctl_par_h,^ 

tag_dirty_h,^ 

tag_shared_h^ 

addr_cmd_par_h, Input hold ns 

cmd_h, 

data_check_h,i 

tag_ctl_par_h,^ 

tag_dirty_h,3 

tag_shared_h^ 



1.1 ns 



Tdsu 



Tdsu 



0.5*Tcycle 



Tdh 



Tsdadh 



■"^Read transaction 
^Write transaction 
^Fills from memory 



(continued on next page) 
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Table 68 (Cont.) Output Timing for sys_clk_out- or ref_clk_in-Based Systems 



Clocking System Value 


Clocking System Name 


Signal Specification sys_clk_out ref_clk_in 


sys_clk_out ref_clk_in 


Bidirectional Signals 



Output mode: 
addr_cmd_par_h, 

cmd_h, 

tag_ctl_par_h,'' 
tagdirtyh,'* 
tag_shared h,^ 
tagvalidj? 

data_check_h^ 

addr_cmd_par_h, 

cmd_h, 

tag_ctl_par_h,^ 
tagdirtyh,'^ 
tag_shared h,"^ 
tagvalidj? 

data check h^ 



Output 
delay 



Tdd+0.4 ns 



Tdd+0.5*Tcycle+0.9 ns Taod 



Traod 



Output 
delay 

Output 
hold 



Tdd+Tcycle-K).4 ns Tdd+1.5*Tcycle-K).9 ns Tdod 
Tmdd Tmdd Taoh 



Trdod 
Traoh 



Output 
hold 



Tmdd+Tcycle Tmdd+Tcycle 



Tdoh 



Trdoh 



^Write transaction 

*Only for write broadcasts and system transactions 
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Signals in Table 69 are used to control Bcache data transfers. These signals 
are driven off the CPU clock. The choice of sys_clk_out or ref_clk_in has no 
impact on the timing of these signals. 

Table 69 Bcache Control Signal Timing 



Signal 


Specification 


Value 


Name 


Input mode: 








tag data h, tag data par h, 


Input setup 


1.1ns 


Tdsu 


tag_valid_h 








tag data h, tag data par h, 


Input hold 


ns 


Tdh 


tag_valid_h 








Output mode: 








data ram oe h, data ram we h/ 


Output delay 


Tdd-KD.4ns 


Taod 


tagramoeh, tagramweh^ 








tag data h, tag data par h, 


Output delay 


Tdd-KD.4ns 


Taod 


tagvalidh 








data ram oe h, data ram we h/ 


Output hold 


Tmdd 


Taoh 


tagramoeh, tagramweh'" 








tag data h, tag data par h, 


Output hold 


Tmdd 


Taoh 


tagvalidh 








^Pulse width for this signal is controlled thi 


rough the BC_CONFIG IPR. 





11.4.5 Timing of Test Features 

Timing of 21164 testability features depends on the system clock rate and the 
test port's operating mode. This section provides timing information that may 
be needed for most common operations. 

11.4.6 Icache BiSt Operation Timing 

The Icache BiSt is invoked by deasserting the external reset signal 
sys_reset_l. Figure 75 shows the timing between various events relevant 
to BiSt operations. 
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Figure 75 BiSt Timing Event-Time Line 
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The timing for deassertion of internal reset (time t2, see asterisk) is valid 
only if an SROM is not present (indicated by keeping signal srom_present_l 
deasserted). If an SROM is present, the SROM load is performed once the 
BiSt completes. The internal reset signal T%Z_RESET_B_L is extended until 
the end of the SROM load (Section 11.4.7). I n this case, the end of the time 
line shown in Figure 75 connects to the beginning of the time line shown in 
Figure 76. 

Table 70 and Table 71 list timing shown in Figure 75 for some of the system 
clock ratios. Time ti is measured starting from the rising edge of sysclk 
following the deassertion of the sys_reset_l signal. 

Table 70 BiSt Timing for Some System Clock Ratios, Port Mode=Normal 
(System Cycles) 



Sysclk 


System Cycles 




Ratio 


<i 


<2 


<3 


3 


8 


22644+21/2 


22645 


4 


7 


19721+21/2 


19722 


15 


7 


13291+141/2 


13292 



Preliminary — Subject to Change — Juiy 1996 195 



Table 71 BiSt Tinning for Some System Cloclt Ratios, Port l\/lode=Normal 
(CPU Cycles) 



Sysclk 
Ratio 



CPU Cycles 



<3 



3 


24 


679341/2 


67935 


4 


28 


788861/2 


78888 


15 


105 


1993791/2 


199380 



11.4.7 Automatic SROM Load Timing 

The SROM load is triggered by the conclusion of BiSt if srom_present_l is 
asserted. The SROM load occurs at the internal cycle time of approximately 
126 CPU cycles for srom_clk_h, but the behavior at the pins may shift slightly. 

Timing events are shown in Figure 76 and are listed in Table 72 and Table 73. 



Figure 76 SROM Load Timing Event-Time Line 
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Table 72 SROM Load Timing for Some System Clock Ratios (System Cycles) 

Sysclk System Cycles^ 

Ratio ii t2 is i4 ts 



3 


4 


22 


4408090 


44082164% 


4408217 


4 


3 


48 


3306099 


3306193+21/2 


3306194 


15 


3 


13 


881627 


8816514^1/2 


881652 



^Measured in sysclk cycles, where -Hi refers to an additional n CPU cycles. 
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Table 73 SROM Load Timing for Some System Clock Ratios (CPU Cycles) 



Sysclk CPU Cycles 
Ratio ii t2 



tz 



U 



<5 



3 


12 


4 


12 


15 


45 



66 13224270 1322464872 13224651 

192 13224396 13224774y2 13224776 

195 13224405 1322477472 13224780 



Figure 77 is a timing diagram of an SROIM load sequence. 



Figure 77 Serial ROM Load Timing 
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The minimum srom_clk_h cycle = (126 - syscll< ratio) * (CPU cycle time). 

The maximum srom_clk_h to srom_data_h delay allowable (in order to meet 
the required setup time) =[126 - (5 * sysclk ratio)] * (CPU cycle time). 

11.4.8 Clock Test l\/lodes 

This section describes the 21164 clock test modes. 

11.4.9 Normal l\/lode 

When the clk_mode_h<l:0> signals are not asserted, the osc_clk_in_h,l 

frequency is divided by 2. This is the normal operational mode of the clock 
circuitry. 
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11.4.10 Chip Test Mode 

To lower the maximum frequency that the chip manufacturing tester is 
required to supply, a divide-by-1 mode has been designed into the clock 
generator circuitry. When the clk_mode_h<0> signal is asserted and 
clk_nnode_h<l> is not asserted, the clocl< frequency that is applied to the 
input clock signals osc_clk_in_h,l bypasses the clock divider and is sent to 
the chip clock driver. This allows the chip internal circuitry to be tested at full 
speed with a one-half frequency osc_clk_in_h,l. 

11.4.11 Module Test Mode 

When the clk_mode_h<0> signal is not asserted and clk_mode_h<l> 

is asserted, the clock frequency that is applied to the input clock signals 
osc_clk_in_h,l is divided by 4 and is sent to the chip clock driver. The digital 
phase-locked loop (DPLL) continues to keep the onchip sys_clk_outl_h,l 
locked to ref_clk_in_h within the normal limits if a ref_clk_in_h signal is 
applied (0 ns to 1 osc_clk_in_h,l cycle after ref_clk_in_h). 

11.4.12 Clock Test Reset Mode 

When both the clk_mode_h<0>and the clk_mode_h<l> signals are asserted, 
the sys_clk_out generator circuit is forced to reset to a known state. This 
allows the chip manufacturing tester to synchronize the chip to the tester 
cycle. Table 74 lists the test modes. 

Table 74 Test Modes 



Mode 


elk. 


_mode_ 


.h<0> 


clk_mode_h<1> 


N ormal 












Chip test 


1 









Module test 









1 


Clock reset 


1 






1 



11.4.13 IEEE 1149.1 (JTAG) Performance 

Table 75 lists the standard mandated performance specifications for the 
IEEE 1149.1 circuits. 
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Table 75 IEEE 1149.1 Circuit Performance Specifications 

Item Specification 

trstj is asynchronous. Minimum pulse width. 4 ns 

trst_l setup time for deassertion before a transition on 4 ns 

tck_h. 

jviaximum acceptable tck_h clock frequency. 16.6 MHz 

tdi_h/tms_h setup time (referenced totck_h rising edge). 4 ns 

tdi_h/tms_h hold time (referenced totck_h rising edge). 4 ns 

Maximum propagation delay at pin tdo_h (referenced to 14 ns 
tck_h falling edge). 

Maximum propagation delay at system output pins 20 ns 

(referenced totck_h falling edge). 

11.5 Power Supply Considerations 

For correct operation of the 21164, all of the Vss pins must be connected to 
ground and all of the Vdd pins must be connected to a 3.3 V ±5% power source. 
This source voltage should be guaranteed (even under transient conditions) at 
the 21164 pins, and not just at the PCB edge. 

Plus 5 V is not used in the 21164. The voltage difference between the Vdd 
pins and Vss pins must never be greater than 3.6 V. If the differential exceeds 
this limit, the 21164 chip will be damaged. 

11.5.1 Decoupling 

The effectiveness of decoupling capacitors depends on the amount of inductance 
placed in series with them. The inductance depends both on the capacitor style 
(construction) and on the module design. I n general, the use of small, high 
frequency capacitors placed close to the chip package's power and ground pins 
with very short module etch will give best results. Depending on the user's 
power supply and power supply distribution system, bulk decoupling may also 
be required on the module. 

Each individual case must be separately analyzed, but generally designers 
should plan to use at least 6 fiF of capacitance. Typically, 40 to 60 small, 
high frequency 0.1-fif capacitors are placed near the chip's Vdd and Vss pins. 
Actually placing the capacitors in the pin field is the best approach. Several 
tens of fiF of bulk decoupling (comprised of tantalum and ceramic capacitors) 
should be positioned near the 21164 chip. 
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Use capacitors that are as physically small as possible. Connect the capacitors 
directly to the 21164 Vdd and Vss pins by short (0.64 cm [0.25 in] or 
less) surface etch. The small capacitors generally have better electrical 
characteristics than the larger units, and will more readily fit close to the 
I PGA pin field. 

11.5.2 Power Supply Sequencing 

Although the 21164 uses a 3.3-V (nominal) power source, most of the other 
logic on the PCB probably requires a 5-V power supply. These 5-V devices can 
damage the 21164's I/O circuits if the 5-V power source powering the PCB logic 
and the 3.3-V (Vdd) supply feeding the 21164 are not sequenced correctly. 

Caution 



To avoid damaging the 21164's I/O circuits, the I/O pin voltages must 
not exceed 4.0 V until the Vdd supply is at least 3.0 V or greater. 



This rule can be satisfied if the Vdd and the 5-V supplies come up together, 
or if the Vdd supply comes up before the 5-V supply is asserted. Bringing 
the lower voltage up before the higher voltage is the opposite of the way that 
CMOS systems with multiple power supplies of different voltages are usually 
sequenced, but it is required for the 21164. 

A three-terminal voltage regulator can be used to make 3.3-V Vdd from the 
5-V supply, provided the output of the regulator (Vdd) tracks the 5-V supply 
with only a small offset. The requirement is that when the 5-V supply reaches 
4.0 V, Vdd must be 3.0 V or higher. While the 5-V supply is below 4.0 V, Vdd 
can be less than 3.0 V. 

All 5-V sources on the 21164's I/O pins should be disabled if the power supply 
sequencing is such that the 5-V supply will exceed 4.0 V before Vdd is at least 
3.0 V. The 5-V sources should remain disabled until the Vdd power supply is 
equal to or greater than 3.0 V. 

Disabling all 5-V sources can be very difficult because there are so many 
possible sneak paths. I nputs, for example, on bipolar TTL logic can be a source 
of current, and will put a voltage across a 21164 I/O pin high enough to violate 
the (no higher than 4.0 V until there is 3.0 V) rule. TTL outputs are specified 
to drive a logic one to at least 2.4 V, but usually drive voltages much higher. 
CMOS logic and CMOS SRAMs usually drive "full rail" signals that match the 
value of the 5-V power supply. 
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Another concern is parallel (dc) terminations or pull-ups connected between 
the 21164 and the 5-V supply. The 3.3 V (Vdd) supply should be used to power 
parallel terminations. 

Disabling the non-21164 5-V outputs of PCB logic is generally possible, but 
raises the PCB complexity and can reduce system performance by increasing 
critical path timing. If the 5-V logic device has an enable pin, circuits (such 
as power supply supervisor chips) on the PCB can monitor the Vdd and 5-V 
supplies. When the supervision circuit detects that 5.0 V is increasing from 
zero while the Vdd supply is below 3.0 V, the power supply supervisor circuit 
produces a disable signal to force all PCB logic with 5-V outputs into the high 
impedance state. This technique will not prevent bipolar TTL inputs from 
acting as a 5-V source, but it can be used to disable sources such as cache RAM 
outputs. 
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12 Thermal Management 

This section describes tine 21164 tinermai management and tinermal design 
considerations. 

12.1 Operating Temperature 

The 21164 is specified to operate when the temperature at the center of the 
heat sink (Tc) is no higher than 72°C (266 IVJHz), 70°C (300 IviHz), or 68°C 
(333 IMHz). Temperature (Tc) should be measured at the center of the heat sinl< 
(between the two pacl<age studs). The GRAF 01 L pad is the interface material 
between the package and the heat sink. 

Table 76 lists the values for the center of heat-sink-to-ambient (Sea) for the 
499-pin grid array. Table 77 shows the allowable Ta (without exceeding Tc) at 
various airflows. 

Note 



Digital recommends using the heat sink because it greatly improves 
the ambient temperature requirement. 



Table 76 Oca at Various Airflows 



Airflow (linear ft/min) 100 


200 


400 


600 


800 


1000 


Frequency: 266, 300, and 333 MHz 


6ca with heat sink 1 (°C/W) 2.30 
6ca with heat sink 2 (°C/W) 1.25 


1.30 
0.75 


0.70 
0.48 


0.53 
0.40 


0.45 
0.35 


0.41 
0.32 
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Table 77 Maximum 7a at Various Airflows 



Airflow (linear ft/min) 


100 


200 


400 


600 


800 


1000 


Frequency: 266 MHz, Power: 


46 W @Vdd ; 


= 3.3V 










Ta with lieatsink 1 (°C) 
Ta with heatsinl< 2 (°C) 


14.5 


37.5 


39.8 
49.9 


47.6 
53.6 


51.3 
55.9 


53.2 
57.3 


Frequency: 300 MHz, Power: 


51 W @Vdd ; 


= 3.3V 










Ta with heat sinl< 1 (°C) 
Ta with heatsinl< 2 (°C) 


— 


31.8 


34.3 
45.5 


43.0 
49.6 


47.1 
52.2 


49.1 
53.7 


Frequency: 333 MHz, Power: 


56 W @Vdd ; 


= 3.3 V 










Ta with heat sinl< 1 (°C) 
Ta with heat sinl< 2 (°C) 


— 


26.0 


28.8 
41.1 


38.3 
45.6 


42.8 
48.4 


45.0 
46.2 
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12.2 Heat Sink Specifications 

Two heat sinks are specified. Heat sink type 1 mounting holes are in line with 
the cooling fins. Heat sink type 2 mounting holes are rotated 90° from the 
cooling fins. The heat sink composition is aluminum alloy 6063. Type 1 heat 
sink is shown in Figure 78, and type 2 heat sink is shown in Figure 79, along 
with their approximate dimensions. 

Figure 78 Type 1 Heat Sink 



6.57 cm 
(2.585 in) 



6.57 cm 
(2.585 in) 



■I 



M 



M 



3.81 cm 

(1.5 in) 

sq. 



2.54 cm 
(1.0 in) 



3.25 cm 
(1.280 in) 
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Figure 79 Type 2 Heat Sink 




3.80 cm 
(1.495 in) 



2.54 cm 
(1.0 in) 



IT 



4.45 cm 
(1.75 in) 



3.81 cm 
(1.5 in) 



12.3 Thermal Design Considerations 

Follow these guidelines for printed circuit board (PCB) component placement: 

• Orient the 21164 on the PCB with the heat sink fins aligned with the 
airflow direction. 

• Avoid preheating ambient air. Place the 21164 on the PCB so that inlet air 
is not preheated by any other PCB components. 

• Do not place other high power devices in the vicinity of the 21164. 

• Do not restrict the airflow across the 21164 heat sink. Placement of other 
devices must allow for maximum system airflow in order to maximize the 
performance of the heat sink. 
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13 Mechanical Specifications 

This section sinows tine 21164 mechanical package dimensions without a heat 
sink. For heat sink information and dimensions, refer to Section 12. 

Package Dimensions 

Figure 80 shows the package physical dimensions without a heat sink. 
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Figure 80 Package Dimensions 
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16 18 20222426 28 30 



foY#PT*M9MW 



32 34 36 38 40 



1.27 mm (0.050 in) Typ 
4.32 mm (0.170 in) Typ— » 

2.54 mm (0.100 in) Typ 
-Standoff (4x) 

^499x 1.40 mm (0.055 in) Typ 
^1.27 mm (0.050 in) Typ 



23 25 27 29 31 33 35 37 39 41 43 

_26.67 mm 

(1.050 in) ^^ 



57.40 mm (2.260 in) Typ- 



28.70 mm 
(1.130 in) Typ 



D 
D 
D 



D 
D 
D 



25.40 mm 
(1.000 in) Typ 



a 
a 
a 



a 
a 



t 



Lid 



26.67 mm 
(1.050 in) 



0.46 mm _J 
(0.018 in) Typ 



0.13 mm _ 
(0.005 in) R 



2.69 mm (0.106 in) Typ 



-1.27 mm 
(0.050 in) Typ 



1/4-20 
Stud (2x) 



- 7.62 mm 
(0.300 in) Typ 



28.70 mm 
(1.130 in) Typ 



Capacitors (12x) 



38.10 mm 
(1.500 in) Typ 



LJ-03457-TI0 



Preliminary— Subject to Cliange—Juiy 1996 207 



Technical Support and Ordering Information 

Technical Support 

If you need technical support or help deciding which literature best meets your 
needs, call the Digital Semiconductor Information Line: 

United States and Canada 1-800-332-2717 
Outside North America +1-508-628-4760 

Ordering Digital Semiconductor Products 

To order Alpha 21164 microprocessor evaluation boards and motherboards, 
contact your local distributor. 

You can order the following semiconductor products from Digital: 

Product Order Number 

Alpha 21164 333-MHz Microprocessor 21164-333 

Alpha 21164 300-MHz Microprocessor 21164-300 

Alpha 21164 300-MHz Microprocessor for Windows NT 21164- P 2 

Alpha 21164 266-MHz Microprocessor 21164-266 

Alpha 21164 266-MHz Microprocessor for Windows NT 21164- PI 

Alpha 21164 Microprocessor Evaluation Board 266 MHz 21A04-01 
Kit (Supports Digital UNIX, OpenVMS, and Windows NT 
operating systems.) 

Alpha 21164 Microprocessor Motherboard 266-MHz Kit 21A04-A0 

(Supports the Windows NT operating system.) 

Ordering Digital Semiconductor Sample Kits 

To order an Alpha 21164 Microprocessor Sample Kit, which contains one 
Alpha 21164 microprocessor, one heat sinl<, and supporting documentation, 
call 1-800-DIGITAL. You will need a purchase order number or credit card to 
order the following products: 

Product Order Number 

Alpha 21164-266 Sample Kit 21164- SA 



Ordering Associated Literature 

The following table lists some of the available Digital Semiconductor literature. 
For a complete list, contact the Digital Semiconductor Information Line. 



Title 



Order Number 



Alpha Architecture Reference Manual^ 

Alpha AXP Architecture Handbook 

Alpha 21164 Microprocessor Hardware Reference Manual 

Alpha 21164 Microprocessor Product Brief 

Alpha 21164 Evaluation Board Read Me First 

Alpha 21164 Evaluation Board Product Brief 

Alpha 21164 Evaluation Board User's Guide 

Alpha 21164 Microprocessor Motherboard Product Brief 

Alpha 21164 Microprocessor Motherboard User's Manual 

DECchip 21171 Core Logic Chipset Product Brief 

DECchip 21171 Core Logic Chipset Technical Reference 
Manual 

Answers to Common Questions about PALcodefor Alpha 
AXP Systems 

PALcodefor Alpha Microprocessors System Design Guide 

Alpha Microprocessors Evaluation Board Windows NT 
3.51 Installation Guide 

SPICE Models for Alpha Microprocessors and Peripheral 
Chips: An Application Note 

Alpha Microprocessors SROM Mini-Debugger User's 
Guide 

Alpha Microprocessors Evaluation Board Debug Monitor 
User's Guide 

Alpha Microprocessors Evaluation Board Software Design 
Tools User's Guide 



EY-L520E-DP-YCH 

EC-QD2KA-TE 

EC-QAEQC-TE 

EC-QAENB-TE 

EC-QD2VB-TE 

EC-QCZZD-TE 

EC-QD2UC-TE 

EC-QSAGA-TE 

EC-QLJ LB-TE 

EC-QC3EB-TE 

EC-QE18B-TE 

EC-N0647-72 

EC-QFGLB-TE 
EC-QLUAD-TE 

EC-QA4XC-TE 

EC-QHUXA-TE 

EC-QHUVB-TE 

EC-QHUWA-TE 



^To order and purchase the Alpha Architecture Rd'erence Manual, call 1-800-DIGITAL from 
the U.S. or Canada, or contact your local Digital office, or technical or reference bookstore where 
Digital Press books are distributed by Prentice Hall. 



Ordering Associated Third-Party Literature 

You can order the following third-party literature directly from the vendor: 



Title 



Vendor 



PCI Systan Design Guide 



PCI Local Bus Specification 
Revision 2.1 

IEEE Standard 754, Standard for 
Binary Floating-Point Arithmetic 



IEEE Standard 1149.1, A Test 
Access Port and Boundary Scan 
Architecture 



PCI Special I nterest Group 
1-800-433-5177 (U.S.) 
1-503-797-4207 (International) 
1-503-234-6762 (FAX) 

See previous entry. 



IEEE Service Center 

445 Hoes Lane 

P.O. Box 1331 

Piscataway NJ 08855-1331 

1-800-678-1 EEE (U.S. and Canada) 

908-562-3805 (Outside U.S. and Canada) 



See previous entry. 



